Skip to main content

Building AI agents

Build AI agents that interact with users through realtime media and data streams.

Overview

Build AI agents that join LiveKit rooms as participants, process realtime media and data streams, and interact with users through voice, text, and vision. The LiveKit Agents framework provides everything you need to build production-ready voice AI agents and programmatic participants.

When you build agents with the Agents framework, they join rooms as participants just like users from frontend apps. Agents can process audio, video, and data streams in realtime, making them ideal for voice assistants, multimodal AI applications, and custom programmatic participants.

The framework allows you to add Python or Node.js programs to any LiveKit room as full realtime participants. It includes tools and abstractions that make it easy to feed realtime media and data through an AI pipeline that works with any provider, and to publish realtime results back to the room.

Getting started

Build your first agent with these resources:

Learn more

For complete documentation on building agents: