STT models accept additional configuration via theDocumentation Index
Fetch the complete documentation index at: https://docs.gradium.ai/llms.txt
Use this file to discover all available pages before exploring further.
json_config
parameter. In the Python SDK, this is a dict mapping option name to
value (float or string). When using the REST endpoint, pass it as a
URL-encoded JSON string in the json_config query parameter.
These options apply to both the WebSocket
and REST transports. For TTS settings,
see Voice Settings.
Quick reference
| Option | Type | Allowed values | Effect |
|---|---|---|---|
temp | float | 0.0–1.0 | Sampling temperature for text generation. 0.0 is greedy; higher values produce more diverse output and can help when no text is being recognised. |
language | string | "en", "fr", "de", "es", "pt" | Expected language of the audio. Grounds the model to a single language for better transcription quality. For mixed audio, set the main language. Any other value raises a server error. |
padding_bonus | float | -4.0–4.0 | Biases the model toward emitting text sooner (negative) or later (positive). Out-of-range values raise a server error. |
delay_in_frames | int | 7, 8, 10, 12, 14, 16, 20, 24, 32, 36, 48 | Audio frames (80 ms each) of context the model gathers before emitting text. Higher values improve quality at the cost of latency. Other values raise a server error. The legacy alias delay_in_tokens is also accepted. |
Passing json_config
The same json_config payload is sent regardless of which SDK API you
use; only the call shape differs:
json_config as a
URL-encoded JSON string in the query parameters, see the
REST guide.