Options specific to bulbul:v3

interface TTSV3Options {
    apiKey?: string;
    baseURL?: string;
    model: "bulbul:v3";
    pace?: number;
    sampleRate?: number;
    sentenceTokenizer?: tokenize.SentenceTokenizer;
    speaker?: string;
    streaming?: boolean;
    targetLanguageCode?: string;
    temperature?: number;
}

Hierarchy

  • TTSBaseOptions
    • TTSV3Options

Properties

apiKey?: string

Sarvam API key. Defaults to $SARVAM_API_KEY

baseURL?: string

Base URL for the Sarvam API

model: "bulbul:v3"
pace?: number

Speech pace. v2: 0.3–3.0, v3: 0.5–2.0 (default 1.0)

sampleRate?: number

Output sample rate in Hz (default 24000)

sentenceTokenizer?: tokenize.SentenceTokenizer

Sentence tokenizer for streaming (default: basic sentence tokenizer)

speaker?: string

Speaker voice (v3 voices). Default: 'shubh'

streaming?: boolean

Whether to use native WebSocket streaming for stream(). Set to false to prefer non-streaming REST synthesis (used by Agent via TTS StreamAdapter). Default: true.

targetLanguageCode?: string

Target language code (BCP-47)

temperature?: number

Temperature for voice variation, 0.01 to 2.0 (v3 only, default 0.6)