Models Overview

Choose the right model for your text-to-speech needs.

Available Models

YourVoic offers 5 TTS models, each optimized for different use cases:

ModelLatencyQualityLanguagesVoicesExpressionsCredit Cost
aura-lite Fast Good 90+ 1000+ Aura voices Yes 1 credit/char
aura-prime Medium Excellent 90+ 1000+ Aura voices Yes 1 credit/char
aura-max Slower Premium 90+ 1000+ Aura voices Yes 1 credit/char
rapid-flash Fastest Good 18 62 voices No 1 credit/char
rapid-max Fast Very Good 41 30 Aura voices No 2 credits/char
Two distinct voice sets

Aura voices (Peter, Kylie, Rahul, Deepika…) work with aura-lite, aura-prime, aura-max, and rapid-max. (Jacob, Emma, Arjun, Ananya…) are exclusive to rapid-flash. Always fetch voices for your chosen model using ?model=.

Prosody support reminder

If your workflow depends on explicit speed or pitch controls, Rapid models are the safest choice. Aura models focus on natural speech quality; their streaming path does not support pitch.

Model Details

Aura Lite

Our fastest Aura model, optimized for real-time applications.

  • Best for: Chatbots, voice assistants, real-time apps
  • Features: Expression tags, 90+ languages, 1000+ Aura voices
  • Voice set: Aura voices — fetch with ?model=aura-lite
response = requests.post(
    "https://yourvoic.com/api/v1/tts/generate",
    headers={"X-API-Key": "your_api_key"},
    json={
        "text": "Quick response for real-time chat.",
        "voice": "Peter",
        "language": "en-US",
        "model": "aura-lite"
    }
)

Aura Prime

Balanced model with excellent quality and reasonable speed.

  • Best for: General purpose, e-learning, presentations
  • Features: Expression tags, enhanced prosody, 1000+ Aura voices
  • Voice set: Aura voices — fetch with ?model=aura-prime
response = requests.post(
    "https://yourvoic.com/api/v1/tts/generate",
    headers={"X-API-Key": "your_api_key"},
    json={
        "text": "Professional quality for your content.",
        "voice": "Peter",
        "language": "en-US",
        "model": "aura-prime"
    }
)

Aura Max

Premium quality model for professional content.

  • Best for: Audiobooks, professional voiceovers, media production
  • Features: Highest quality, natural prosody, expression tags, 1000+ Aura voices
  • Voice set: Aura voices — fetch with ?model=aura-max
response = requests.post(
    "https://yourvoic.com/api/v1/tts/generate",
    headers={"X-API-Key": "your_api_key"},
    json={
        "text": "Premium audiobook quality narration.",
        "voice": "Peter",
        "language": "en-US",
        "model": "aura-max"
    }
)

Rapid Flash

Ultra-fast model using a dedicated Neural voice engine for lowest latency applications.

  • Best for: IVR systems, notifications, high-volume processing
  • Voice set: 62 voices (Jacob, Emma, Arjun, Ananya…) — exclusively for this model
  • Limitation: Only 18 languages supported; no expression tags
response = requests.post(
    "https://yourvoic.com/api/v1/tts/generate",
    headers={"X-API-Key": "your_api_key"},
    json={
        "text": "Ultra-fast response.",
        "voice": "Jacob",
        "language": "en-US",
        "model": "rapid-flash"
    }
)

Rapid Max

Fast model combining the Aura voice set with speed optimizations.

  • Best for: Balance of speed and quality across many languages
  • Features: 41 languages, 30 voices, speed and pitch control
  • Voice set: Aura voices — fetch with ?model=rapid-max
response = requests.post(
    "https://yourvoic.com/api/v1/tts/generate",
    headers={"X-API-Key": "your_api_key"},
    json={
        "text": "Fast with great quality.",
        "voice": "Peter",
        "language": "en-US",
        "model": "rapid-max"
    }
)

Fetching Voices per Model

Always use ?model= when fetching the voice list to get the correct set for your chosen model:

# Aura voices for aura-prime (English names)
curl "https://yourvoic.com/api/v1/voices?model=aura-prime&language=en-US" \
  -H "X-API-Key: YOUR_API_KEY"

# Aura voices for aura-prime (Hindi names)
curl "https://yourvoic.com/api/v1/voices?model=aura-prime&language=hi" \
  -H "X-API-Key: YOUR_API_KEY"

# Neural voices for rapid-flash
curl "https://yourvoic.com/api/v1/voices?model=rapid-flash" \
  -H "X-API-Key: YOUR_API_KEY"

# Aura voices for rapid-max
curl "https://yourvoic.com/api/v1/voices?model=rapid-max&language=en-US" \
  -H "X-API-Key: YOUR_API_KEY"

Choosing the Right Model

Use CaseRecommended ModelWhy
Voice Assistantaura-liteFast response, expression support
Audiobooksaura-maxPremium quality, natural prosody
E-learningaura-primeClear speech, balanced quality
IVR / Phone Systemsrapid-flashLowest latency
Notificationsrapid-flashFast and cost-effective
Video Narrationaura-maxProfessional quality
Chatbotsaura-liteReal-time with expressions
Batch Processingrapid-maxGood speed/quality balance at volume

Models Overview (V2)

Choose the right AI model for your text-to-speech needs.

Available Models

YourVoic offers 5 TTS models, each optimized for different use cases:

ModelLatencyQualityLanguagesExpressionsControl NotesCredit Cost
aura-lite Fast Good 90+ Yes Streaming speed supported; streaming pitch unavailable. Generate controls are conservative in the playground. 1 credit/char
aura-prime Medium Excellent 90+ Yes Streaming speed supported; streaming pitch unavailable. 1 credit/char
aura-max Slower Premium 90+ Yes Best natural quality; streaming pitch unavailable. 1 credit/char
rapid-flash Fastest Good 18 No Supports speed and pitch in standard generation and pseudo-streaming. 1 credit/char
rapid-max Fast Very Good 41 No Generate supports speed and pitch; streaming supports speed only. 2 credits/char
Prosody support reminder

If your workflow depends on explicit speed or pitch controls, Rapid models are the safest choice. Aura models focus first on natural speech quality and accent guidance, and their streaming path does not support pitch.

Model Details

Aura Lite

Our fastest Aura model, optimized for real-time applications.

  • Best for: Chatbots, voice assistants, real-time apps
  • Latency: ~200ms first byte
  • Features: Expression tags, all languages, all voices
response = requests.post(
    "https://yourvoic.com/api/v1/tts/generate",
    headers={"X-API-Key": "your_api_key"},
    json={
        "text": "Quick response for real-time chat.",
        "voice": "Peter",
        "model": "aura-lite"
    }
)

Aura Prime

Balanced model with excellent quality and reasonable speed.

  • Best for: General purpose, e-learning, presentations
  • Latency: ~400ms first byte
  • Features: Expression tags, enhanced prosody
response = requests.post(
    "https://yourvoic.com/api/v1/tts/generate",
    headers={"X-API-Key": "your_api_key"},
    json={
        "text": "Professional quality for your content.",
        "voice": "Peter",
        "model": "aura-prime"
    }
)

Aura Max

Premium quality model for professional content.

  • Best for: Audiobooks, professional voiceovers, media production
  • Latency: ~800ms first byte
  • Features: Highest quality, natural prosody, expression tags
response = requests.post(
    "https://yourvoic.com/api/v1/tts/generate",
    headers={"X-API-Key": "your_api_key"},
    json={
        "text": "Premium audiobook quality narration.",
        "voice": "Peter",
        "model": "aura-max"
    }
)

Rapid Flash

Ultra-fast model for lowest latency applications.

  • Best for: IVR systems, notifications, high-volume processing
  • Latency: ~100ms first byte
  • Limitation: Only 18 languages supported
Rapid Flash Language Support

Only these 18 languages: Danish, German, English (Australia), English (UK), English (India), English (US), Spanish (Spain), Spanish (US), Filipino, French (Canada), French (France), Hindi, Italian, Japanese, Korean, Portuguese (Brazil), Thai, Vietnamese

response = requests.post(
    "https://yourvoic.com/api/v1/tts/generate",
    headers={"X-API-Key": "your_api_key"},
    json={
        "text": "Ultra-fast response.",
        "voice": "Peter",
        "language": "en-US",
        "model": "rapid-flash"
    }
)

Rapid Max

Fast model with better quality than Rapid Flash.

  • Best for: Balance of speed and quality
  • Latency: ~150ms first byte
  • Features: All 50+ languages, good quality
response = requests.post(
    "https://yourvoic.com/api/v1/tts/generate",
    headers={"X-API-Key": "your_api_key"},
    json={
        "text": "Fast with better quality.",
        "voice": "Peter",
        "model": "rapid-max"
    }
)

Choosing the Right Model

Use CaseRecommended ModelWhy
Voice Assistantaura-liteFast response, supports expressions
Audiobooksaura-maxPremium quality, natural prosody
E-learningaura-primeClear speech, good quality
IVR/Phone Systemsrapid-flashLowest latency
Notificationsrapid-flashFast, cost-effective
Video Narrationaura-maxProfessional quality
Chatbotsaura-liteReal-time with expressions
Batch Processingrapid-maxGood balance for volume