Voice Agent
ChatbotVoice (core)
POST /dashboard/chatbot-voice/storeElevenLabs Conversational AI agent — single-platform STT+LLM+TTS
Realtime Voice Studio
AresGen's voice agents run on ElevenLabs Conversational AI, a single platform that handles speech-to-text, reasoning, and speech back. Embed via iframe, drive via REST API, train with your knowledge base.
1 provider · 8 agent personas · 3 RAG input types · iframe + REST embed
By the numbers
Backend surfaces
Voice Agent CRUD on ElevenLabs Conversational AI, Knowledge Training with rag-training, an iframe-embed widget, and a public REST API. Every surface is wired to the AresGen backend with first-party endpoints.
ChatbotVoice (core)
POST /dashboard/chatbot-voice/storeElevenLabs Conversational AI agent — single-platform STT+LLM+TTS
ChatbotVoiceTrainController
POST /dashboard/chatbot-voice/train/{file,text,url}Embedding-based RAG — file ⨯ text ⨯ URL → vector retrieval
ChatbotVoiceController::frame
GET /chatbot-voice/{uuid}/framePublic iframe, no auth — drop into any site with one tag
ChatbotVoiceEmbbedController + History
GET /api/v2/chatbot-voice/{uuid}Public conversation logging — `api/v2` for headless integrations
One platform
AresGen real-time voice runs on a single-vendor stack: ElevenLabs Conversational AI: STT + LLM + TTS (single platform). One platform owns the full seam between conversation turns, which is where end-users actually judge voice quality.
Provider: ElevenLabs Conversational AI · STT + LLM + TTS (single platform)
Capabilities
Knowledge retrieval via rag-training crosses three surfaces; public-embed via iframe-embed lives in two; conversation-log includes the REST API. The matrix below is generated from the same facts module the audit gate enforces.
| Capability | Voice Agent | Knowledge Training | Embed Widget | REST API |
|---|---|---|---|---|
| Streaming voice | ✓ | — | ✓ | — |
| Barge-in / interrupt | ✓ | — | ✓ | — |
| Knowledge retrieval | ✓ | ✓ | ✓ | — |
| Multilingual | ✓ | — | ✓ | — |
| Conversation log | ✓ | — | ✓ | ✓ |
| Public embed | — | — | ✓ | ✓ |
Agent demo
Each persona is a curated system-prompt preset, not a separate model. Tap any chip to swap the transcript below. Every opener uses the same “this is your AresGen voice agent” frame so the persona is the visible variable, not the brand.
Persona: Support Tier-1· professional
Use cases
Tier-1 deflection with sub-second turn-taking. The agent handles common questions, escalates with full transcript when a human is needed.
Pricing
Voice agents unlock on Pro and Business. Pro covers knowledge training and the embeddable iframe widget; Business adds the REST API and conversation logging. See /pricing for the full breakdown.
Pair this with
Need TTS without a live conversation loop? Generate one-shot voice clips across 32+ voices and 29 languages.
Explore VoiceoverNeed text-only chat across multiple models? Route between 12 catalog-validated models in one subscription.
Explore AI ChatFAQ
ElevenLabs Conversational AI: a single platform that handles speech-to-text, reasoning, and speech back. No second vendor sits between the user and the agent.
One provider, 8 personas, knowledge-base RAG, iframe + REST embed. Start free.