Indicate the end of speech, emitted when the user stops speaking. The first alternative is a combination of all the previous FINAL_TRANSCRIPT events.
Final transcript, emitted when the STT is confident enough that a certain portion of the speech will not change.
Interim transcript, useful for real-time transcription.
Indicate the start of speech. If the STT doesn't support this event, this will be emitted at the same time as the first INTERIM_TRANSCRIPT.
Indicates start/middle/end of speech