← back to picker
SVARA
read the story
Open-source · Apache 2.0

Next-Generation
Indic Voice AI.

19.

Languages, including the under-resourced.

Hindi, Bengali, Tamil, Telugu, Marathi, Kannada, Malayalam, Punjabi, Gujarati, English, Nepali, Sanskrit, Urdu, Assamese, Odia — and Magahi, Maithili, Bhojpuri, Bodo, Dogri, where most TTS just gives up.

Coverage
400K

Downloads on HuggingFace.

Used in production by developers building accessibility tools, audiobooks, IVR systems, content localization, and consumer apps across India and the diaspora.

Trust
3B

Parameters. Llama backbone. Discrete audio tokens.

Built on Orpheus-style discrete audio tokenization, distilled from a 3B Llama backbone. Runs on a single T4. Quantized GGUF builds for laptops too.

Architecture
<200ms

Latency, end-to-end.

P50 latency sits at 187ms across all 19 languages. Comfortably under the threshold where conversation begins to feel turn-taking, not transactional. Edge-cached. WebSocket streaming.

Speed
Drop-in API

Three lines. Any language.

# pip install openai from openai import OpenAI client = OpenAI(base_url="https://api.svara.ai/v1", api_key="sk-svara-...") audio = client.audio.speech.create(model="svara-v1-indic", voice="asha", input="नमस्ते") audio.stream_to_file("hello.wav")

Start with 10,000
free characters.

Get an API key → Self-host on HuggingFace
No credit card. Apache 2.0. Free forever for the first 10K chars/month.