Agents
Speech

Speech

Your agent can accept voice input and reply with audio. Both can be enabled from Model Settings.

Speech Settings

Open "Model Settings" on the Agents page and look for the Speech section.

Speech Settings

Voice Input (Speech to Text)

Enable Speech to Text to let users send voice messages instead of typing.

Speech to Text Settings

How it works:

  1. Open "Model Settings" on the Agents page
  2. Toggle on "Speech to Text"
  3. A microphone icon appears in the chat interface
  4. Users can click the mic, record their message, and send it
  5. Chatolia uses Whisper to transcribe the audio automatically

Speech to Text Mic Settings

No additional configuration is needed. The feature works immediately after enabling.

Voice Output (Text to Speech)

Enable Text to Speech to have the agent's replies played as audio.

Common settings:

  • Autoplay: Automatically start audio playback after each reply

OpenAI

OpenAI provides the gpt-4o-mini-tts model with 4 built-in voices:

  • Alloy
  • Echo
  • Fable
  • Onyx

OpenAI Speech Settings

How to use:

  1. Open "Model Settings"
  2. Toggle on "Text to Speech"
  3. Select "OpenAI" as the speech provider
  4. Choose a voice from the dropdown
  5. (Optional) Enable "Autoplay"

Speech generation uses Chatolia credits.

ElevenLabs

ElevenLabs offers two models:

  • Flash v2.5: Fast, efficient voice synthesis
  • Multilingual v2: Support for multiple languages

ElevenLabs comes with all default voices available for selection.

ElevenLabs Speech Settings

How to use:

  1. Open "Model Settings"
  2. Toggle on "Text to Speech"
  3. Select "ElevenLabs" as the speech provider
  4. Choose a model (Flash v2.5 or Multilingual v2)
  5. Select a voice from the dropdown
  6. (Optional) Enable "Autoplay"

Speech generation uses Chatolia credits unless you provide your own API key.

Use your own ElevenLabs API Key

If you want to use your own ElevenLabs account:

  1. Enable "Use my ElevenLabs key"
  2. Enter your ElevenLabs API key
  3. Click the "Sync" button that appears
  4. Your personal voices will be synced and available for selection

ElevenLabs Speech Settings API Key

Benefits of using your own key:

  • No Chatolia credits are deducted for speech generation
  • Access to your custom ElevenLabs voices
  • Direct billing through your ElevenLabs account

Notes

  • Speech to Text uses Whisper for transcription
  • OpenAI Text to Speech uses Chatolia credits
  • ElevenLabs uses Chatolia credits unless you provide your own API key
  • When using your own ElevenLabs key, speech generation is billed directly to your ElevenLabs account
  • Public agent pages respect your speech settings