Use the REST endpoint when you have a complete audio file in hand and want a single HTTP request, no WebSocket to manage. The server reads the full body, transcribes it, and streams newline-delimited JSON (NDJSON) messages back as the body.Documentation Index
Fetch the complete documentation index at: https://docs.gradium.ai/llms.txt
Use this file to discover all available pages before exploring further.
Streaming use case?
For live audio (microphone, telephony) or to react to VAD and flush
events in real time, use the SDK streaming guide instead.
Quickstart
Response format
The response isContent-Type: application/x-ndjson. Each line is a
JSON object with a type field, text and end_text are the ones
you’ll typically care about; error may appear if the pipeline fails
mid-stream. The body closes when transcription is complete.
For the full message schema, request body, query parameters, and error
shapes, see the STT POST Endpoint reference.
Passing json_config
Advanced options (temp, language, padding_bonus, delay_in_frames)
are passed as a JSON-encoded string in the json_config query
parameter. The example below sends a language hint:
Next steps
STT POST API reference
Full request/response schema, query parameters, content types.
Streaming with the SDK
Low-latency, microphone-friendly transcription with VAD and flush.