Describes the capabilities of the STT provider.

Remarks

At present, the framework only supports providers that have a streaming endpoint.

interface STTCapabilities {
    alignedTranscript?: false | "word" | "chunk";
    interimResults: boolean;
    streaming: boolean;
}

Properties

alignedTranscript?: false | "word" | "chunk"

Whether this STT supports aligned transcripts with word/chunk timestamps.

  • 'word': Provider returns word-level timestamps
  • 'chunk': Provider returns chunk-level timestamps (e.g., sentence/phrase boundaries)
  • false: Provider does not support aligned transcripts
interimResults: boolean
streaming: boolean