Interface SpeechData

SpeechData contains metadata about this SpeechEvent.

interface SpeechData {
    confidence: number;
    endTime: number;
    language: LanguageCode;
    metadata?: Record<string, unknown>;
    sourceLanguages?: LanguageCode[];
    sourceTexts?: string[];
    speakerId?: null | string;
    startTime: number;
    targetLanguages?: LanguageCode[];
    targetTexts?: string[];
    text: string;
    words?: TimedString[];
}

Index

Properties

confidence endTime language metadata? sourceLanguages? sourceTexts? speakerId? startTime targetLanguages? targetTexts? text words?

Properties

confidence

confidence: number

Confidence score of the transcription (0-1).

endTime

endTime: number

End time of the speech segment in seconds.

language

language: LanguageCode

Language code of the speech.

`Optional` metadata

metadata?: Record<string, unknown>

Optional plugin-specific metadata (e.g. voice profile, provider diagnostics).

Plugins may populate this with provider-specific data that doesn't map to standard fields.

`Optional` sourceLanguages

sourceLanguages?: LanguageCode[]

The source languages spoken by the user.

Populated by STT services that support translation, where language holds the target language and sourceLanguages holds the original spoken language(s), or by multi-language detection services where language holds the dominant language and sourceLanguages holds all detected languages sorted by prevalence.

May contain multiple entries when a single utterance spans multiple source languages.

`Optional` sourceTexts

sourceTexts?: string[]

The original transcription segments in the source language(s), when translation is active. Each entry corresponds to the same-indexed entry in sourceLanguages.

`Optional` speakerId

speakerId?: null | string

Speaker identifier when the provider supports diarization.

startTime

startTime: number

Start time of the speech segment in seconds.

`Optional` targetLanguages

targetLanguages?: LanguageCode[]

The target language(s) produced by a translation-capable STT service, one entry per consecutive same-language run, parallel to targetTexts.

language holds the dominant or first target language and targetLanguages carries the fine-grained per-run breakdown. Populated when translation is active.

`Optional` targetTexts

targetTexts?: string[]

The translated transcription segments in the target language(s). Each entry corresponds to the same-indexed entry in targetLanguages.

text

text: string

Transcribed text.

`Optional` words

words?: TimedString[]

Word-level timing information.

Interface SpeechData

Index

Properties

Properties

confidence

endTime

language

`Optional` metadata

`Optional` sourceLanguages

`Optional` sourceTexts

`Optional` speakerId

startTime

`Optional` targetLanguages

`Optional` targetTexts

text

`Optional` words

Settings

Member Visibility

Theme

On This Page

Interface SpeechData

Index

Properties

Properties

confidence

endTime

language

Optional metadata

Optional sourceLanguages

Optional sourceTexts

Optional speakerId

startTime

Optional targetLanguages

Optional targetTexts

text

Optional words

Settings

Member Visibility

Theme

On This Page

`Optional` metadata

`Optional` sourceLanguages

`Optional` sourceTexts

`Optional` speakerId

`Optional` targetLanguages

`Optional` targetTexts

`Optional` words