Collections Agent Proposal
This is a proposal-level document for presales discussions. It focuses on architecture logic, product value, and delivery feasibility.
The core principle is to let the Agent handle standardized communication and workflow execution within authorized and auditable boundaries, while human specialists focus on higher-value cases.
1. Solution Goals
In collections, the key is not just “being able to chat.” The real goal is to contact the right customer, at the right time, with the right strategy, through the right channel, and move the case to the right next step.
The overall system is organized into four layers:
flowchart TB
U[Channel Layer<br/>IM / call transcript / email / in-app / web] --> O[Orchestration Layer<br/>Sage Agent Runtime]
O --> P[Policy Layer<br/>routing / compliance / scripts / escalation]
O --> M[Memory & Asset Layer<br/>customer profile / history / repayment plan / KB]
O --> A[Action Layer<br/>auto reply / callback / handoff / ticket creation]
The solution targets four outcomes:
- Improve first-contact efficiency
- Reduce manual workload
- Strengthen compliance stability
- Continuously improve from real outcomes
2. Overall Architecture
The collections system uses a single main execution Agent plus multiple auxiliary Agents rather than a fully peer-to-peer multi-agent setup.
The main execution Agent is the center of the business loop. It:
- Understands the current conversation
- Decides the next action
- Calls tools to execute work
- Composes the reply
- Decides when to hand off to a human
Auxiliary Agents provide specialized capabilities:
- Compliance Agent: checks whether the reply or action crosses a boundary
- Review Agent: analyzes successful and failed conversations
- Knowledge Agent: retrieves policy, FAQ, and account explanations
- Customer Insight Agent: extracts customer state, risk tags, and communication preferences
- Skill Builder Agent: turns repeatable experience into reusable skills
flowchart LR
In[Incoming message] --> Main[Main Execution Agent]
Main --> K[Knowledge Agent]
Main --> C[Compliance Agent]
Main --> R[Review Agent]
Main --> S[Skill Builder Agent]
Main --> H[Human Handoff]
K --> Main
C --> Main
R --> Main
S --> Main
Main --> Out[Reply / Handoff / End]
Why the Main Execution Agent Stays Central
Collections is a closed business loop. The key question is not who can speak better, but who is responsible for the current case.
If multiple Agents are peer-level decision makers, three issues appear:
- Responsibility becomes diluted
- Case state can become inconsistent
- Runtime complexity grows quickly
With a centralized main Agent, the system behaves like a command center plus specialist advisors:
- The command center makes decisions and executes actions
- Advisors provide information, checks, and reusable experience
This is easier to explain, easier to govern, and easier to audit.
3. Agent Development and Runtime Model
3.0 Sage sagents Runtime Framework
` sagents ` is the runtime kernel of the system. It does not care whether the business is collections or customer service. It provides a stable execution model for models, tools, skills, memory, and audit.
The core logic is:
Session holds state, Flow holds the path, Agent makes decisions, Tool performs actions, Skill stores experience, Sandbox isolates execution, and Observability keeps the trace.
flowchart TB
Caller[External caller<br/>IM / API / batch job] --> SAgent[SAgent entry]
SAgent --> Sess[Session / SessionManager]
Sess --> Flow[FlowExecutor / AgentFlow]
Flow --> Agent[Agent instance]
Agent --> Ctx[SessionContext]
Agent --> LLM[Model layer]
Agent --> Tool[ToolManager]
Agent --> Skill[SkillManager]
Tool --> Sandbox[Sandbox / safe execution]
Skill --> Sandbox
Sess --> Obs[Observability / Trace]
Agent --> Obs
Tool --> Obs
LLM --> Obs
3.0.1 Why This Base Layer Matters
For a high-frequency business like collections, the hard part is not making a model say one sentence. The hard part is keeping the system stable over time, making the right action, and leaving the right record.
sagents solves four foundational problems:
- Unified state
- Unified execution
- Unified capability access
- Unified auditability
3.0.2 End-to-End Request Path
sequenceDiagram
participant Caller as Caller
participant SAgent as SAgent
participant SM as SessionManager
participant Sess as Session
participant Flow as FlowExecutor
participant Ag as Main Agent
participant LLM as Model
participant Tool as Tool Layer
participant Obs as Observability
Caller->>SAgent: Send message / event
SAgent->>SM: Get or create Session
SM-->>SAgent: Session
SAgent->>Sess: Bind context / model / workspace
SAgent->>Obs: Record start event
SAgent->>Flow: Run current flow
Flow->>Ag: Invoke main Agent
Ag->>LLM: Reason / generate / decide
LLM-->>Ag: Result or tool intent
opt Tool needed
Ag->>Tool: Query KB / lookup customer / send IM / handoff
Tool-->>Ag: Tool result
end
Ag->>Sess: Write back state / risk / next action
Ag->>Obs: Record decision trace
Ag-->>SAgent: Return message or action
SAgent-->>Caller: Return result
3.0.3 Runtime Principles
- The main Agent is the only decision entry point
- Tools are executed through a unified dispatch layer
- Skills capture reusable experience
- Sandbox keeps side effects controlled
- Observability makes the process explainable
3.1 Role Split
flowchart TB
M[Main Execution Agent<br/>Unified decision and execution] --> D[Conversation execution]
M --> P[Policy judgment]
M --> K[Knowledge lookup]
M --> C[Compliance check]
M --> T[Human handoff]
M --> R[Review analysis]
M --> S[Skill capture]
The main Agent is the control center. The auxiliary capabilities are plugin-like specialists.
- Main Agent: reads context, decides next steps, calls tools, composes replies, controls pacing
- Knowledge capability: provides factual, policy, and account explanations
- Compliance capability: checks boundaries before and after generation
- Handoff capability: produces a summary and transfers the case to a human specialist
- Review capability: analyzes good and bad conversations and extracts improvement points
- Skill capability: turns proven experience into reusable business skills
3.2 Runtime Mode
The runtime uses an event-driven + state-driven + tool/skill-driven model:
- Each message enters a Session
- Session stores customer state, history, risk tags, collections stage, and channel context
- The main Agent makes local decisions based on the Session
- The main Agent chooses whether to call a model, a tool, or a Skill
- Every critical action is logged
flowchart TB
In[External event / new message] --> Sess[Session]
Sess --> Ctx[SessionContext<br/>state / history / risk / stage]
Ctx --> M[Main Execution Agent]
M -->|facts needed| K[Knowledge / customer info tools]
M -->|actions needed| T[IM send / handoff / ticket tools]
M -->|experience needed| S[Skill call]
M -->|guardrails needed| C[Compliance Agent]
K --> M
T --> M
S --> M
C --> M
M --> Out[Reply / action / escalate / end]
3.2.1 Runtime Decision Logic
The main Agent first evaluates:
- Is the task simple?
- Is the context short?
- Is speed important?
- Is the output structured?
- Does the task need deeper reasoning or higher-quality generation?
When the task is simple, the context is short, and speed matters, the small model is used. When the task is complex, the context is long, or the answer needs stronger reasoning or higher-quality expression, the large model is used.
3.3 Benefits
- Faster delivery
- Easier quality control
- Better presales storytelling
- Easier multi-channel scale-out
4. Small and Large Model Orchestration
In Sage, small/large model orchestration is not a hardcoded model switch. It is a dynamic dispatch decision made by the main execution Agent based on task type.
The recommended pattern is:
- Small models handle high-frequency lightweight tasks
- Large models handle complex judgment and high-value generation
flowchart TB
In[User message / external event] --> Main[Main Execution Agent]
Main --> Route{Task routing}
Route -->|simple / short context / speed-first| Small[Small model worker]
Route -->|structured extraction / tagging / summarization| Small
Route -->|complex reasoning / negotiation / rewriting| Large[Large model worker]
Route -->|compliance / risk review| Check[Compliance Agent]
Small --> Main
Large --> Main
Check --> Main
Main --> Out[Reply / action / handoff]
4.1 How Sage Dispatches Models
Model dispatch is split into three layers:
- Routing layer: the main Agent classifies the task
- Orchestration layer: a model, prompt, and tool set are selected
- Constraint layer: format, compliance, confidence, and factual consistency are checked
This means Sage does not just switch a model name. It switches the entire execution strategy:
- Small models get shorter context, stronger structure, and fewer tools
- Large models get broader context, more freedom, and richer tool chains
- Compliance Agent reviews both outputs
flowchart LR
A[Main Execution Agent] --> B{Select execution strategy}
B --> C[Small model strategy<br/>simple tasks / short context / speed-first<br/>structure / classification / extraction]
B --> D[Large model strategy<br/>complex tasks / long context / quality-first<br/>negotiation / rewriting / deep reasoning]
C --> E[Structured output check]
D --> E
E --> F[Compliance and factual review]
F --> G[Action / reply]
4.2 Why This Fits Collections
Collections is naturally tiered:
- Most messages are standard Q&A, status checks, or simple reminders
- Fewer messages require complex negotiation, dispute handling, or escalation
The optimal strategy is therefore:
- Use small models for simple tasks
- Use small models when context is short
- Use small models when speed matters
- Use small models for structured work such as classification and extraction
- Use large models for complex negotiation, long-context reasoning, and higher-quality responses
- Let the main Agent unify the final decision
This keeps cost, speed, and quality in balance.
4.3 Execution Flow
flowchart TB
A[Receive message] --> B[Main Agent parses state]
B --> C{Task route}
C -->|standard classification| S1[Small model]
C -->|knowledge lookup| S2[Small model + retrieval]
C -->|negotiation / rewriting| L1[Large model]
C -->|high-risk case| C1[Compliance Agent]
S1 --> D[Write back to Session]
S2 --> D
L1 --> D
C1 --> D
D --> E[Main Agent output]
5. Continuous Improvement
The core idea is not that the model magically becomes smarter. The system is designed as a closed learning loop.
flowchart LR
A[Real conversations] --> B[Outcome labeling]
B --> C[Success / failure / human handoff / risk]
C --> D[Review analysis]
D --> E[Policy optimization]
D --> F[Skill distillation]
D --> G[Evaluation set updates]
E --> A
F --> A
G --> A
5.1 Review Agent
A dedicated Review Agent analyzes successful and failed conversations:
- Why did this case succeed?
- Was it the wording, the timing, or the strategy?
- Which sentence changed the customer’s response?
- Which response increased friction?
- Which case should be handed to a human earlier?
Its output is not just a summary. It produces actionable conclusions:
- reusable conversation patterns
- expressions to avoid
- escalation thresholds
- skill candidates
5.2 Learning Successful Paths
When a pattern keeps working, the system can:
- Extract the key turning points
- Convert them into a reusable template
- Add them to the regression evaluation set
flowchart TB
X[Successful real conversation] --> Y[Pattern extraction]
Y --> Z[Candidate policy / skill]
Z --> Q[Offline evaluation]
Q -->|pass| W[Grey release]
Q -->|fail| Y
W --> X
6. From Experience to AI Capability
The goal is to turn human experience into machine-callable capability assets.
6.1 Agent-Assisted Skill Authoring
A Skill Builder Agent helps business experts turn experience into structured skills:
- trigger conditions
- recommended wording
- forbidden wording
- applicable customer groups
- escalation conditions
- example dialogues
flowchart TB
A[Business expert speaks experience] --> B[Skill Builder Agent]
B --> C[Structured skill draft]
C --> D[Compliance review]
D --> E[Published as callable capability]
6.2 Skill Distillation from Successful Conversations
The system can also distill skills from real successful conversations:
- Identify high-conversion segments
- Capture key context
- Summarize trigger conditions
- Build a standard skill
- Publish it to the skill library
This turns scattered human know-how into durable system knowledge.
7. Compliance and Hallucination Control
In collections, the most important requirement is not “sounds human.” It is “is correct, allowed, and traceable.”
7.1 Three Guardrails
flowchart TB
A[Before generation] --> B[During generation]
B --> C[After generation]
A --> A1[Permission / customer / stage / blacklist / sensitive term checks]
B --> B1[Controlled prompts / tool constraints / structured output]
C --> C1[Compliance review / fact check / risk scoring / audit trail]
Guardrail 1: Before generation
- Check whether the customer may be contacted automatically
- Check whether the channel is compliant
- Check whether the current stage forbids certain phrasing
- Check whether handoff is mandatory
- Check customer tags such as dispute, complaint risk, or legal sensitivity
Guardrail 2: During generation
- Keep the model inside controlled tools and templates
- Force structured output to reduce free-form drift
- Provide enough context, but do not expose disallowed information
Guardrail 3: After generation
- Compliance Agent reviews again
- Facts are checked for consistency
- Sensitive language is detected
- If needed, the reply is rewritten, downgraded, or handed off
7.2 Engineering Controls Against Hallucination
- Retrieval first
- Structured actions first
- Confidence thresholds
- Dual-model cross-checking in critical steps
7.3 Reply Principles
The system follows these principles:
- Do not exaggerate consequences
- Do not make unverified promises
- Do not disclose unauthorized personal information
- Do not use coercive or misleading wording
- Clarify first when uncertain
- Hand off to a human when needed
8. Human Collaboration and Escalation
The Agent is not replacing humans. It is removing low-value work from humans.
8.1 When to Handoff
- Complaint or strong negative emotion
- Billing dispute
- Special negotiation request
- High-risk rule trigger
- Uncertain facts
- No progress after multiple turns
8.2 What the Handoff Contains
The handoff is not raw conversation text. It includes:
- current customer state
- recent key turns
- strategies already tried
- current risk tags
- recommended next action
- warnings and forbidden expressions
flowchart TB
A[Agent detects escalation] --> B[Generate handoff summary]
B --> C[Human specialist takes over]
C --> D[Human continues the conversation]
D --> E[Outcome fed back]
9. Tools and Skills
This section is important in presales because it shows the system is not just conceptual. It is executable.
9.1 Tool List
The tool layer is designed as a standard action set:
- IM message sending
- human handoff
- knowledge base lookup
- customer information lookup
- conversation history lookup
- ticket creation
- follow-up scheduling
- risk checks
- audit logging
9.2 Skill List
The skill layer captures reusable experience:
- Collections script skill
- first-contact skill
- negotiation skill
- unreachable-customer recovery skill
- complaint handling skill
- compliance phrasing skill
- human handoff skill
- review extraction skill
- knowledge Q&A skill
9.3 Why Skills Matter
Skills make experience durable:
- Higher consistency
- Faster replication
- Easier maintenance
- Better asset accumulation
10. Value Proposition
The value can be summarized in four sentences:
- Faster: standard scenarios are handled automatically
- Safer: compliance and risk control are built in
- Cheaper: small models handle high-frequency work
- Smarter: the system improves from real outcomes
11. Delivery Path
The delivery path is split into three phases:
Phase 1: Usable
- Connect one or two main channels
- Enable routing, auto-reply, and human handoff
- Establish basic compliance and audit logging
Phase 2: Better
- Add small/large model orchestration
- Add review Agents and skill capture
- Improve complex negotiation handling
Phase 3: Evolving
- Distill skills from real conversations
- Build evaluation and grey release loops
- Turn experience into organizational assets
12. Closing
This solution is not a “chatbot.” It is a controllable, explainable, upgradeable, and compliant collections intelligent system.
The customer sees not a single AI feature, but a business infrastructure that can keep expanding:
- entry points can connect
- the middle layer can be controlled
- policy can be tuned
- humans can take over
- experience can be accumulated
- risk can be contained
That is what makes the solution persuasive in presales.