Optional apiOptional domainOptional endOptional formatOptional keytermsOptional languageOptional maxOptional maxMaximum silence (ms) before end-of-turn is forced regardless of confidence.
Optional minMinimum silence (ms) before a confident end-of-turn is finalized.
Optional promptOnly supported with the u3-rt-pro model.
Optional speakerEnable speaker diarization. Note: AssemblyAI will return per-word speaker
labels, but the JS framework's stt.SpeechData type does not yet expose
a speakerId field (unlike the Python framework), so the labels are not
currently surfaced on emitted events. Setting this to true still has
effect server-side. Once the base SpeechData interface gains speaker
support, #processStreamEvent should forward data.words[].speaker too.
Optional vad
How large each chunk of audio is before being sent to AssemblyAI, in milliseconds. Corresponds to Python's
buffer_size_seconds(seconds there, ms here per this repo's time-unit convention).