Platform
The Voice Pipeline
Behind Every Call.
A 5-step orchestration layer connecting AI4Bharat STT, an LLM, and IndicTTS into a single production-grade voice platform.
Architecture
Five Steps —
One Seamless Flow
Every call flows through a carefully orchestrated pipeline — fast, resilient, and fully swappable.
Call Arrives
Citizen calls via any phone network. Audio streams in real-time — no buffering.
Speech Recognised
Voice converts to text while the citizen is still speaking. First words in under 100ms.
AI Finds the Answer
Understands the question, searches the knowledge base, generates a contextual response.
Speaks the Answer
Converts text to natural voice in the caller's language. Audio starts in under 250ms.
VoicERA Orchestrates
Coordinates every step, holds conversation memory, retries on errors. Swap any component independently.
Modular by Design
Swap any component independently — speech engine, AI model, or voice output. VoicERA manages the flow. Your stack, your rules.
Built With
3–4 months of engineering → now it takes a few hours. Same infrastructure. Unlimited voice services.
Agent Platform
From Idea to Live
Voice Agent
What once took 3–4 months of engineering now takes a few hours.
Create an Agent
Name it. Write a system prompt in plain English — or any Indic language. Pick a knowledge base.
agent:name: "Kisan Helpline"prompt: "You are an agricultureassistant. Answer inMarathi."language: mr
Configure It
Pick your language, voice engine, phone number. Upload a PDF — the agent learns from it instantly.
language: Marathi (mr)voice_engine: AI4Bharat TTSphone: +91-XXXXXXXXXXknowledge_base:- pesticide_guide.pdf- crop_calendar.pdf
Go Live
Test with a real call. Deploy. Handles 24/7 calls with full transcripts, recordings, and analytics.
status: LIVEcalls_handled: 12,847avg_response: 1.8ssuccess_rate: 88.2%uptime: 99.9%
Platform Capabilities
Agent Builder
Prompt-driven, multi-turn conversations with full knowledge-base integration.
Campaign Manager
Outbound calls, CSV bulk upload, retry logic, and scheduling.
Analytics
Full call transcripts, recordings, drop-off rates, success tracking.
Observability
CPU, GPU, memory, and end-to-end latency in a single view.
What's Next
You have a domain.
What would you do
with a voice?
Farmers who don't use apps. Patients who can't read forms. Citizens who hang up on IVRs.
Deploy VoicERA
For your domain — crop helpline, health triage, grievance system, scheme eligibility. Let's talk today.
Deploy it todayFork the Codebase
Build your own vertical. The entire stack is yours — speech engine, LLM, voice output. Extend at will.
Fork on GitHubContribute
Language models, adapters, orchestration. Build the voice layer for India — together, in the open.
Start contributingIndia's voice isn't English. It's 22 languages, 140 crore people, and one phone call away from the help they need.
VoicERA makes sure someone — or something — is always listening.