Voice AI quickstart | LiveKit Documentation

Overview

This guide walks you through the setup of your very first voice assistant using LiveKit Agents. In less than 10 minutes, you'll have a voice assistant that you can speak to in your terminal, browser, telephone, or native app.

LiveKit Agent Builder

The LiveKit Agent Builder is a quick way to get started with voice agents in your browser, without writing any code. It's perfect for prototyping and exploring ideas, but doesn't have as many features as the full LiveKit Agents SDK. See the Agent Builder guide for more details.

Coding agent support

LiveKit is built for coding agents like Claude Code , Cursor , and Codex . These agents can build agents and frontends with the LiveKit SDKs and manage resources with the LiveKit CLI. Give your agent LiveKit expertise using the LiveKit CLI or Docs MCP server. For more information, see the coding agents guide.

Starter projects

The simplest way to get your first agent running is with one of the following starter projects. You can create a project from a template with the CLI (see Quick start with CLI) or click "Use this template" on GitHub and follow the project's README.

These projects are constructed with best practices, a complete working agent, tests, and an AGENTS.md optimized to turn coding agents like Claude Code and Cursor into LiveKit experts.

Python starter project

Ready-to-go Python starter project. Clone a repo with all the code you need to get started.

livekit-examples/agent-starter-python

Node.js starter project

Ready-to-go Node.js starter project. Clone a repo with all the code you need to get started.

livekit-examples/agent-starter-node

Requirements

The following sections describe the minimum requirements to get started with LiveKit Agents.

LiveKit Agents requires Python >= 3.10.
This guide uses the uv package manager.

LiveKit Agents for Node.js requires Node.js >= 20.
This guide uses pnpm package manager and requires pnpm >= 10.15.0.

LiveKit Cloud

This guide assumes you have signed up for a free LiveKit Cloud account. LiveKit Cloud includes agent deployment, model inference, and realtime media transport. Create a free project and use the API keys in the following steps to get started.

While this guide assumes LiveKit Cloud, the instructions can be adapted for self-hosting the open source LiveKit server instead. For self-hosting in production, set up a custom deployment environment, and make the following changes: remove the enhanced noise cancellation plugin from the agent code, and use plugins for your own AI providers.

LiveKit CLI

Use the LiveKit CLI to manage LiveKit API keys and deploy your agent to LiveKit Cloud.

Install the LiveKit CLI:
Install the LiveKit CLI with Homebrew :
```
brew install livekit-cli
```
```
curl -sSL https://get.livekit.io/cli | bash
```
Tip
You can also download the latest precompiled binaries here .
```
winget install LiveKit.LiveKitCLI
```
Tip
You can also download the latest precompiled binaries here .
This repo uses Git LFS for embedded video resources. Please ensure git-lfs is installed on your machine before proceeding.
```
git clone github.com/livekit/livekit-cli
make install
```
Link your LiveKit Cloud project to the CLI:
```
lk cloud auth
```
This opens a browser window to authenticate and link your project to the CLI.

Quickstart steps

The following sections walk you through the steps to get your first agent running.

Setup with CLI

The simplest way to get your first agent running is with the LiveKit CLI.

Make sure your project meets all requirements, then run:

lk agent init my-agent --template agent-starter-python

lk agent init my-agent --template agent-starter-node

The CLI clones the template into the my-agent directory, creates an .env.local file with your LiveKit credentials, and prints the next steps to run your agent.

Save the chat link

Open the link provided by the CLI after the line "To try your new agent in the web console, visit:" to speak to your agent in the following step.

Follow the instructions it prints, which guide you through the following steps:

Select a project to use — If you don't have a default project set, the CLI prompts you to select a project to use.
Change into the project directory — The project directory is named after your agent.
```
cd my-agent
```
Install dependencies — Install the agent's runtime and plugin dependencies if you did not electo have them automatically installed during template setup.
```
uv sync
```
```
pnpm install
```

Download model files — Required for the noise cancellation plugin.

uv run --module livekit.agents download-files

npx livekit-agents download-files

Run your agent — Run your agent in development mode.
```
uv run src/agent.py dev
```
```
pnpm dev
```

Speak to your agent

If you opened the Console link provided by the CLI in the previous step, return to your browser and click Start a session. Otherwise, you can always find the Console on your project's Agents dashboard . Use the microphone button to speak to your agent and see its responses in real time, and explore the tool panes to measure your agent's behavior and performance in detail.

Other options

You can customize your agent by choosing different AI models and by exploring testing and deployment options.

AI models

Voice agents require one or more AI models to provide understanding, intelligence, and speech. LiveKit Agents supports both high-performance STT-LLM-TTS voice pipelines constructed from multiple specialized models, as well as realtime models with direct speech-to-speech capabilities. For help deciding which pipeline fits your use case, see Pipeline types.

Your agent strings together three specialized providers into a high-performance voice pipeline powered by LiveKit Inference. No additional setup is required.

Component	Model	Alternatives
STT	Deepgram Nova-3	STT models
LLM	OpenAI GPT-5.3 Chat	LLM models
TTS	Cartesia Sonic-3	TTS models

Your agent uses a single realtime model to provide an expressive and lifelike voice experience.

Model	Required Key	Alternatives
OpenAI Realtime API	`OPENAI_API_KEY`	Realtime models

You can change the AI models used by editing your agent file. Full agent files for STT-LLM-TTS and Realtime models can be found in the Agent code section.

Test and deploy

Use different modes and deployment options to test and deploy your agent.

Server startup modes

Start your agent server in development or production modes.

console mode: For Python only, runs your agent locally in your terminal.
dev mode: Run your agent in development mode for testing and debugging.
start mode: Run your agent in production mode.

To learn more about these modes, see the Server startup modes reference.

For Python agents, run the following command to start your agent in production mode:

uv run src/agent.py start

The Node.js starter includes build and start scripts. To run in production mode:

pnpm build
pnpm start

Connect to Agent Console

Start your agent in dev mode to connect it to LiveKit and make it available from anywhere on the internet:

uv run src/agent.py dev

pnpm dev

Use the Agent Console to interact with and debug your agent in realtime. Note that you'll need to set the Agent name, which should be my-agent for this quickstart.

Deploy to LiveKit Cloud

Run lk agent create from the project directory to register and deploy.

After the deployment completes, you can access your agent in Agent Console, or continue to use the console mode as you build and test your agent locally.

Agent code

Once you have the quickstart running, you can dig into the agent code. For the difference between realtime and chained (STT-LLM-TTS) pipelines, see AI models. The tabs below show the full files for each pipeline type so you can swap, copy, or adapt them.

from dotenv import load_dotenv

from livekit import agents
from livekit.agents import AgentServer, AgentSession, Agent, inference, room_io, TurnHandlingOptions
from livekit.plugins import ai_coustics

load_dotenv(".env.local")


class Assistant(Agent):
    def __init__(self) -> None:
        super().__init__(
            instructions="""You are a helpful voice AI assistant.
            You eagerly assist users with their questions by providing information from your extensive knowledge.
            Your responses are concise, to the point, and without any complex formatting or punctuation including emojis, asterisks, or other symbols.
            You are curious, friendly, and have a sense of humor.""",
        )

server = AgentServer()

@server.rtc_session(agent_name="my-agent")
async def my_agent(ctx: agents.JobContext):
    session = AgentSession(
        stt=inference.STT(model="deepgram/nova-3", language="multi"),
        llm=inference.LLM(model="openai/chat-latest"),
        tts=inference.TTS(
            model="cartesia/sonic-3",
            voice="9626c31c-bec5-4cca-baa8-f8ba9e84c8bc",
        ),
        turn_handling=TurnHandlingOptions(
            turn_detection=inference.TurnDetector(),
        ),
    )

    await session.start(
        room=ctx.room,
        agent=Assistant(),
        room_options=room_io.RoomOptions(
            audio_input=room_io.AudioInputOptions(
                noise_cancellation=ai_coustics.audio_enhancement(model=ai_coustics.EnhancerModel.QUAIL_VF_S),
            ),
        ),
    )

    await session.generate_reply(
        instructions="Greet the user and offer your assistance."
    )


if __name__ == "__main__":
    agents.cli.run_app(server)

import {
  type JobContext,
  ServerOptions,
  cli,
  defineAgent,
  inference,
  voice,
} from '@livekit/agents';
import * as aiCoustics from '@livekit/plugins-ai-coustics';
import { fileURLToPath } from 'node:url';
import dotenv from 'dotenv';
import { createAgent } from './agent';

dotenv.config({ path: '.env.local' });

export default defineAgent({
  entry: async (ctx: JobContext) => {
    const session = new voice.AgentSession({
      stt: new inference.STT({ model: 'deepgram/nova-3', language: 'multi' }),
      llm: new inference.LLM({ model: 'openai/chat-latest' }),
      tts: new inference.TTS({
        model: 'cartesia/sonic-3',
        voice: '9626c31c-bec5-4cca-baa8-f8ba9e84c8bc',
      }),
      turnHandling: {
        turnDetection: new inference.TurnDetector(),
      },
    });

    await session.start({
      agent: createAgent(),
      room: ctx.room,
      inputOptions: {
        noiseCancellation: aiCoustics.audioEnhancement({ model: 'quailVfS' }),
      },
    });

    await ctx.connect();

    const handle = session.generateReply({
      instructions: 'Greet the user and offer your assistance.',
    });
  },
});

cli.runApp(new ServerOptions({ agent: fileURLToPath(import.meta.url), agentName: 'my-agent' }));

import { voice } from '@livekit/agents';

export function createAgent() {
  return voice.Agent.create({
    instructions: `You are a helpful voice AI assistant.
      You eagerly assist users with their questions by providing information from your extensive knowledge.
      Your responses are concise, to the point, and without any complex formatting or punctuation including emojis, asterisks, or other symbols.
      You are curious, friendly, and have a sense of humor.`,
  });
}

from dotenv import load_dotenv

from livekit import agents
from livekit.agents import AgentServer, AgentSession, Agent, room_io
from livekit.plugins import (
    openai,
    ai_coustics,
)

load_dotenv(".env.local")

class Assistant(Agent):
    def __init__(self) -> None:
        super().__init__(instructions="You are a helpful voice AI assistant.")

server = AgentServer()

@server.rtc_session(agent_name="my-agent")
async def my_agent(ctx: agents.JobContext):
    session = AgentSession(
        llm=openai.realtime.RealtimeModel(
            voice="coral"
        )
    )

    await session.start(
        room=ctx.room,
        agent=Assistant(),
        room_options=room_io.RoomOptions(
            audio_input=room_io.AudioInputOptions(
                noise_cancellation=ai_coustics.audio_enhancement(model=ai_coustics.EnhancerModel.QUAIL_VF_S),
            ),
        ),
    )

    await session.generate_reply(
        instructions="Greet the user and offer your assistance. You should start by speaking in English."
    )


if __name__ == "__main__":
    agents.cli.run_app(server)

import {
  type JobContext,
  ServerOptions,
  cli,
  defineAgent,
  voice,
} from '@livekit/agents';
import * as openai from '@livekit/agents-plugin-openai';
import * as aiCoustics from '@livekit/plugins-ai-coustics';
import { fileURLToPath } from 'node:url';
import dotenv from 'dotenv';
import { createAgent } from './agent';

dotenv.config({ path: '.env.local' });

export default defineAgent({
  entry: async (ctx: JobContext) => {
    const session = new voice.AgentSession({
      llm: new openai.realtime.RealtimeModel({
        voice: 'coral',
      }),
    });

    await session.start({
      agent: createAgent(),
      room: ctx.room,
      inputOptions: {
        noiseCancellation: aiCoustics.audioEnhancement({ model: 'quailVfS' }),
      },
    });

    await ctx.connect();

    await session.generateReply({
      instructions: 'Greet the user and offer your assistance. You should start by speaking in English.',
    });
  },
});

cli.runApp(new ServerOptions({ agent: fileURLToPath(import.meta.url), agentName: 'my-agent' }));

import { voice } from '@livekit/agents';

export function createAgent() {
  return voice.Agent.create({
    instructions: 'You are a helpful voice AI assistant.',
  });
}

Next steps

Follow these guides to bring your voice AI app to life in the real world.

Web and mobile frontends

Put your agent in your pocket with a custom web or mobile app.

Telephony integration

Your agent can place and receive calls with LiveKit's SIP integration.

Testing your agent

Add behavioral tests to fine-tune your agent's behavior.

Building voice agents

Comprehensive documentation to build advanced voice AI apps with LiveKit.

Agent server

Learn how to manage your agents with agent servers and jobs.

Deploying to LiveKit Cloud

Learn more about deploying and scaling your agent in production.

AI Models

Explore the full list of AI models available with LiveKit Agents.

Recipes

A comprehensive collection of examples, guides, and recipes for LiveKit Agents.