Integration guides for LiveKit Agents

The LiveKit Agents framework supports integrations with LLM, STT, TTS, and realtime API providers to enable developers to build intelligent, multimodal AI agents.

The Agents framework supports integrations for providers using plugins. This topic describes LLM, STT, and TTS provider plugins. Additional plugins include support for Retrieval-Augmented Generation (RAG), Natural Language Toolkit (NLTK), LlamaIndex, Silero VAD, turn detection, and more. To see the list of additional plugins and learn more about how LiveKit plugins work, see Working with plugins.

If you want to use a provider not listed in the following sections, contributions for plugins are always welcome. To learn more, see the the guidelines for contributions to the Python repository or the Node.js repository.

Realtime API integrations

Realtime APIs are designed for ulta-low-latency AI responses and use multimodal models. They can be better at understanding a user and their emotions resulting in more natural interactions.

LLM integrations

You can create an instance of an LLM to use in a VoicePipelineAgent for the following providers using a plugin:

STT integrations

LiveKit plugins support the following STT providers. Create an instance of an STT to use in a `VoicePipelineAgent` or as a standalone transcription service.
ProviderSupported frameworks
AssemblyAIPython
AzurePython, Node.js
AWSPython
ClovaPython
DeepgramPython, Node.js
falPython
GooglePython
GroqPython, Node.js
OpenAIPython, Node.js
SpeechmaticsPython

TTS integrations

LiveKit plugins support the following TTS providers. Create an instance of a TTS to use in a `VoicePipelineAgent` or for speech generation in your apps.
ProviderSupported frameworks
AzurePython
AWSPython
CartesiaPython, Node.js
DeepgramPython, Node.js
ElevenlabsPython, Node.js
NeuphonicPython
OpenAIPython, Node.js
PlayHTPython
RimePython

Was this page helpful?