Skip to main content

Building voice agents

In-depth guide to voice AI with LiveKit Agents.

Overview

Building a great voice AI app requires careful orchestration of multiple components. LiveKit Agents is built on top of the Realtime SDK to provide dedicated abstractions that simplify development while giving you full control over the underlying code.

Voice AI providers

You can choose from a variety of providers for each part of the voice pipeline to fit your needs. The framework supports both high-performance STT-LLM-TTS pipelines and speech-to-speech models. In either case, the framework automatically manages interruptions, transcription forwarding, turn detection, and more.

You may add these components to the AgentSession, where they act as global defaults within the app, or to each individual Agent if needed.

Capabilities

The following guides, in addition to others in this section, cover the core capabilities of the AgentSession and how to leverage them in your app.