Convert text to speech audio. Returns WAV audio. Supports prosody planning and conversational mode for voice agents. Max text length: 5000 characters.
Bearer token authentication. Supports two token types:
POST /v1/auth/loginPOST /v1/api-tokens, format: cmnd_<tokenId>.<secret>Text to synthesize (max 5000 characters)
5000Voice ID from /v1/echo/tts/voices
Speech speed multiplier (0.5-2.0)
Language code (en, de, tr)
Enable prosody planning for natural pacing
Enable conversational mode with auto-detected presets
Speech pacing for conversational mode
fast, normal, slow Audio file (WAV)
The response is of type file.