The request duration in milliseconds, 0.0 if the STT is streaming.
Optional inputInput audio tokens (for token-based billing).
Optional metadata?: MetricsMetadataMetadata for model provider and name tracking.
Optional outputOutput text tokens (for token-based billing).
Whether the STT is streaming (e.g using websocket).
The duration of the pushed audio in milliseconds.