Text-to-Speech Overview

Gradium exposes Text-to-Speech over two transports. They share the same models and voices; you pick the transport that matches your input shape and latency needs.

WebSocket vs REST

If your text is…	Use	Why
Streamed in token-by-token (LLM output, agent loop, live UI)	WebSocket	Lowest first-byte latency. Send text as it arrives, audio chunks come back as soon as they’re generated.
A finished block in hand (article, scripted line, batch job)	REST	One HTTP call, no connection to manage. Save the response body to a file and you’re done.
Many concurrent synthesis requests	WebSocket + Multiplexing	Run several requests over a single socket, matched by `client_req_id`.

If you’re unsure, start with REST. It’s a single POST and the response body is the audio. Move to WebSocket once latency or streaming-input concerns make REST the bottleneck.

Voices: any voice_id from the Voice Library works on both.
Output formats: PCM, WAV, Opus, and the telephony codecs (ulaw_8000, alaw_8000).
Voice settings: temp, cfg_coef, padding_bonus, rewrite_rules, pronunciation_id. See Voice Settings.
Pronunciation dictionaries: pass pronunciation_id in either transport.

What’s transport-specific

<flush> and <break time="..." /> tags are processed by the model in both transports, but only meaningfully useful when you’re streaming text in over the WebSocket.
WebSocket-only: setup-message stream controls (send_setup_on_start, wait_for_ready_on_start), multiplexing, in-stream flush. See WebSocket Stream Options.
REST-only: only_audio toggle to choose between raw audio bytes and an NDJSON stream that mirrors the WebSocket protocol.

Next steps

Use the WebSocket API

SDK and direct WebSocket usage, streaming output, flush, timestamps.

Use the REST API

One-shot synthesis with a single HTTP POST.

Voice settings

Speed, temperature, voice similarity, rewrite rules.

Voice Library

Browse flagship voices or create your own clones.

Getting Started

Text-to-Speech

Speech-to-Text

Shared

Voices

Resources

Text-to-Speech Overview

WebSocket vs REST

What’s transport-specific

Next steps

Use the WebSocket API

Use the REST API

Voice settings

Voice Library

Getting Started

Text-to-Speech

Speech-to-Text

Shared

Voices

Resources

Documentation Index

​WebSocket vs REST

​What both transports share

​What’s transport-specific

​Next steps

Use the WebSocket API

Use the REST API

Voice settings

Voice Library

WebSocket vs REST

What both transports share

What’s transport-specific

Next steps