Overview
Agents are composable within a single session to model complex tasks with a high degree of reliability. Some specific scenarios where this is useful include:
- Acquiring recording consent at the beginning of a call.
- Collecting specific structured information, such as an address or a credit card number.
- Moving through a series of questions, one at a time.
- Leaving a voicemail message when the user is unavailable.
- Including multiple personas with unique traits within a single session.
Defining a custom agent
Extend the Agent
class to define a custom agent.
from livekit.agents.voice import Agentclass HelpfulAssistant(Agent):def __init__(self):super().__init__(instructions="You are a helpful voice AI assistant.")async def on_enter(self) -> None:await self.session.say("Hello, how can I help you today?")
Handing off control to another agent
Return a different agent from within a tool call to hand off control. This allows the LLM to make decisions about when handoff should occur.
from livekit.agents.voice import Agentfrom livekit.agents.llm import function_toolclass ConsentCollector(Agent):def __init__(self):super().__init__(instructions="""Your are a voice AI agent with the singular task to collect positiverecording consent from the user. If consent is not given, you must end the call.""")async def on_enter(self) -> None:await self.session.say("May I record this call for quality assurance purposes?")@function_tool()async def on_consent_given(self):"""Use this tool to indicate that consent has been given and the call may proceed."""# Perform a handoff, immediately transfering control to the new agentreturn HelpfulAssistant()@function_tool()async def end_call(self) -> None:"""Use this tool to indicate that consent has not been given and the call should end."""await self.session.say("Thank you for your time, have a wonderful day.")await self.session.aclose()
Context preservation
By default, each new agent starts with a fresh conversation history for their LLM prompt. To include the prior conversation, set the chat_ctx
parameter in the Agent
constructor. You can either copy the prior agent's chat_ctx
, or construct a new one based on custom business logic to provide the appropriate context.
from livekit.agents.llm import ChatContext, function_toolfrom livekit.agents.voice import Agentclass HelpfulAssistant(Agent):def __init__(self, chat_ctx: ChatContext):super().__init__(instructions="You are a helpful voice AI assistant.",chat_ctx=chat_ctx)class ConsentCollector(Agent):# ...@function_tool()async def on_consent_given(self):"""Use this tool to indicate that consent has been given and the call may proceed."""# Pass the chat context during handoffreturn HelpfulAssistant(chat_ctx=self.session.chat_ctx)
The complete conversation history for the session is always available in session.history
.
Passing state
To store custom state within your session, use the userdata
attribute. The type of userdata is up to you, but the recommended approach is to use a dataclass
.
from livekit.agents.voice import AgentSessionfrom dataclasses import dataclass@dataclassclass MySessionInfo:user_name: str | None = Noneage: int | None = None
To add userdata to your session, pass it in the constructor. You must also specify the type of userdata on the AgentSession
itself.
session = AgentSession[MySessionInfo](userdata=MySessionInfo(),# ... tts, stt, llm, etc.)
Userdata is available as session.userdata
, and is also available within function tools on the RunContext
. The following example shows how to use userdata in an agent workflow that starts with the IntakeAgent
.
class IntakeAgent(Agent):def __init__(self):super().__init__(instructions="""Your are an intake agent. Learn the user's name and age.""")@function_tool()async def record_name(self, context: RunContext[MySessionInfo], name: str):"""Use this tool to record the user's name."""context.userdata.user_name = namereturn self._handoff_if_done()@function_tool()async def record_age(self, context: RunContext[MySessionInfo], age: int):"""Use this tool to record the user's age."""context.userdata.age = agereturn self._handoff_if_done()def _handoff_if_done(self):if self.session.userdata.user_name and self.session.userdata.age:return HelpfulAssistant()else:return Noneclass HelpfulAssistant(Agent):def __init__(self):super().__init__(instructions="You are a helpful voice AI assistant.")async def on_enter(self) -> None:userdata: MySessionInfo = self.session.userdataawait self.session.generate_reply(instructions=f"Greet {userdata.user_name} and tell them a joke about being {userdata.age} years old.")
Overriding plugins
You can override any of the plugins used in the session by setting the corresponding attributes in your Agent
constructor. For instance, you can change the voice for a specific agent by overriding the tts
attribute:
from livekit.agents.voice import Agentfrom livekit.plugins import cartesiaclass AssistantManager(Agent):def __init__(self):super().__init__(instructions="You are manager behind a team of helpful voice assistants.",tts=cartesia.TTS(voice="6f84f4b8-58a2-430c-8c79-688dad597532"))
Examples
A complete example is available on GitHub
Storyteller agent
An example with multiple phases of conversation.
Restaurant agent
A complex workflow with order details, shared cart, and checkout.
Further reading
For more information on concepts touched on in this article, see the following related articles: