Streaming use case?
For live audio (microphone, telephony) or to react to VAD and flush
events in real time, use the SDK streaming guide instead.
Quickstart
Response format
The response isContent-Type: application/x-ndjson. Each line is a
JSON object with a type field, text and end_text are the ones
you’ll typically care about; error may appear if the pipeline fails
mid-stream. The body closes when transcription is complete.
For the full message schema, request body, query parameters, and error
shapes, see the STT POST Endpoint reference.
Passing json_config
Advanced options (temp, language, padding_bonus, delay_in_frames)
are passed as a JSON-encoded string in the json_config query
parameter. The example below sends a language hint:
Next steps
STT POST API reference
Full request/response schema, query parameters, content types.
Streaming with the SDK
Low-latency, microphone-friendly transcription with VAD and flush.