Skip to content

Voice Agents

Voice agents are AI personas that make and receive phone calls, generate personalized outreach copy, and respond to emails — all using your org’s context and knowledge base. Each agent has a configurable personality, voice, phone number, and set of skills.

How It Works

  1. Go to Agents → Agents
  2. Click New and choose Agent or Twin
  3. Configure the agent’s persona, identity, and knowledge
  4. Attach skills and activate

Agents vs. Twins

TypePurposeExtra Features
AgentGeneral-purpose AI persona for calls, emails, and chatPreset voice selection
TwinDigital replica of a real personCustom voice cloning, LinkedIn profile extraction, writing style upload

Twins inherit their source agent’s persona, role, and voice — then layer on personal context from LinkedIn data and writing samples to match the real person’s tone.

Persona

The Persona tab controls how your agent communicates.

SettingWhat It Controls
RoleSDR, AE, Receptionist, CSM, GTM Engineer, or Custom
Tone / FormalityCasual (0.1) → Moderate (0.5) → Formal (0.8)
ConcisenessDetailed (0.1) → Balanced (0.5) → Direct (0.8)
AssertivenessGentle (0.1) → Balanced (0.5) → Assertive (0.8)
Persona PromptFull free-text instructions for the agent’s behavior
Outbound Call InstructionsOverride behavior specifically for outbound calls
Inbound Call InstructionsOverride behavior specifically for inbound calls
Voicemail MessagePer-agent voicemail script (falls back to org-level if empty)

Voicemail messages support template variables: {first_name}, {bot_name}, {company_name}.

Identity

The Identity tab assigns communication channels to your agent.

  • Phone Number — select from available Twilio numbers. Required to make or receive calls
  • Voice — choose a preset Cartesia voice (Michael, Sarah, David, Emma, and others) or, for twins, clone a custom voice
  • Calendar — link an appointment type for booking meetings during calls
  • Company Knowledge — toggle access to the org-wide knowledge base

Voice Cloning (Twins Only)

  1. In the agent form, switch to Custom voice mode
  2. Click Clone New Voice
  3. Select a sample script (Professional, Conversational, or Friendly)
  4. Record up to 10 seconds of audio or upload a WAV/MP3 file
  5. Click Clone Voice

Cloned voices are saved to your org and can be reused across multiple twins.

Knowledge

Agents can access two types of knowledge:

  • Company Knowledge — org-wide context including company description, services, products, pain points, value proposition, pitch, FAQs, target audience, and competitive advantage
  • Collections — up to 4 document collections per agent. Documents are chunked and vectorized for retrieval during conversations

Skills

Attach reusable AI skills to give your agent capabilities beyond conversation. Go to the Skills tab on the agent detail page and click Attach Skills.

Skills have three runtime types:

TypeWhat It Does
LLMRuns a prompt template against an AI model with configurable temperature and token limits
APICalls an external HTTP endpoint with method, headers, body template, and auth
ScriptExecutes Python, JavaScript, or TypeScript code with configurable timeout (up to 300s)

Skills can require human approval before execution. Execution history — including input, output, duration, token count, and cost — is tracked on the agent’s Activity tab.

On-Demand Content Generation

Voice agents power personalized content generation at send time across channels:

ChannelWhat Happens
EmailPer-contact personalized email generated using CDP data, campaign context, and conversation history
LinkedInPer-contact LinkedIn message generated at send time — no static templates needed
Call ScriptPer-contact call script generated with live CDP data before dialing
VoicemailDynamic voicemail message personalized per contact

On-demand generation pulls from your Global Context, campaign documents, and the contact’s enrichment data. Twin tone profiles (formality, conciseness, assertiveness) are applied to match the sender’s style.

Phone Calls

Outbound Calls

Agents can place outbound calls via Twilio or Telnyx. Each call links to a campaign and sequence step for attribution. After the call, a callback delivers the transcript, summary, disposition, sentiment, recording, and duration.

Inbound Calls

Assign a phone number to an agent to handle inbound calls automatically. The agent uses your inbound instructions, company knowledge, and attached collections to respond. If the agent can’t resolve the call, it falls back to voicemail or queues for an SDR to claim.

Testing

Click Test Digital Agent on the agent detail page to run a test before going live.

Phone Call Test

  • Outbound — enter a phone number and click Start Test Call. A live status card shows call state (Dialing → Ringing → Active → Completed) with a real-time transcript
  • Inbound — call the agent’s assigned number from your phone. The modal shows the expected greeting and instructions

Email Test

  • Respond — select a pre-built scenario (Billing Inquiry, Technical Support, Sales Inquiry, Feature Request, Service Complaint), review the incoming email, and generate a response
  • Compose — choose a scenario or write custom context, then generate a fresh email with recipient info and context

Activity & Analytics

The Activity tab on the agent detail page shows a filterable feed of all interactions:

FilterWhat It Shows
CallsInbound/outbound calls with transcript, audio player, summary, disposition, sentiment, and duration
SkillsSkill executions with status, token count, cost, input/output JSON, and errors
WorkflowsWorkflow runs with status, duration, input/output data

The Analytics tab shows aggregate metrics: total calls, success rate, average duration, and total duration.

Next Steps

  • Skills → — Create reusable AI actions for agents
  • Workflows → — Chain skills and actions into automated multi-step processes
  • Dialer → — Use agents with the parallel dialer for outbound calling
  • Sequencer → — Enroll contacts in sequences that use agent-generated content