Latest total time from audio-frame creation to prediction receive, in milliseconds.
Optional metadata?: MetricsMetadataNumber of prediction requests served (incremental).
Latest time taken by the model side, in milliseconds.
Latest RTT time taken to perform inference, in milliseconds.
Per-prediction telemetry for the audio EOT (end-of-turn) detector. Emitted by transports on each cloud or local prediction so we can track detection latency and inference time per call.