Optional apiSarvam API key. Defaults to $SARVAM_API_KEY
Optional firstFine-grained VAD first-turn minimum speech frames (WS only).
Optional flushEnable flush signal events from server (WS only). Maps to flush_signal query param.
Optional highIncrease VAD sensitivity (WS only). Maps to high_vad_sensitivity query param.
Optional interruptFine-grained VAD interrupt minimum speech frames (WS only).
Optional languageLanguage code (BCP-47). Default: 'en-IN'. Set to 'unknown' for auto-detection.
Optional minFine-grained VAD minimum speech frames (WS only).
Optional modeTranscription mode (v3 only). Default: 'transcribe'
Optional modelOptional negativeFine-grained VAD negative frames count (WS only).
Optional negativeFine-grained VAD negative frames window (WS only).
Optional negativeFine-grained VAD negative speech threshold (WS only).
Optional numFine-grained VAD initial ignored frames (WS only).
Optional positiveFine-grained VAD positive speech threshold (WS only).
Optional preFine-grained VAD pre-speech padding frames (WS only).
Optional promptConversation context to boost model accuracy
Optional startFine-grained VAD start speech volume threshold (WS only).
Optional streamingWhether to use native WebSocket streaming for stream().
Set to false to prefer non-streaming REST recognition (used by Agent via StreamAdapter + VAD).
Default: true.
Optional withReturn chunk-level timestamps in REST response
Options specific to saaras:v3 (recommended).
See
Sarvam STT API docs