Duration of the audio that was recognized in seconds.
Optional
Input audio tokens (for token-based STT billing).
Output text tokens (for token-based STT billing).
Duration of the audio that was recognized in seconds.