Function tool definition | LiveKit Documentation

Overview

The LLM has access to any tools you add to your agent class. This page covers how to define them, use RunContext, handle speech and interruptions, add tools dynamically, and surface errors.

Tool definition

Add tools to your agent class with the @function_tool decorator.

from livekit.agents import function_tool, Agent, RunContext

class MyAgent(Agent):
    @function_tool()
    async def lookup_weather(
        self,
        context: RunContext,
        location: str,
    ) -> dict[str, Any]:
        """Look up weather information for a given location.
        
        Args:
            location: The location to look up weather information for.
        """

        return {"weather": "sunny", "temperature_f": 70}

Add tools to your agent class with the llm.tool function. This example uses Zod to make it easy to provide a typed, annotated tool definition.

import { voice, llm } from '@livekit/agents';
import { z } from 'zod';

class MyAgent extends voice.Agent {
  constructor() {
    super({
      instructions: 'You are a helpful assistant.',
      tools: {
        lookupWeather: llm.tool({
          description: 'Look up weather information for a given location.',
          parameters: z.object({
            location: z.string().describe("The location to look up weather information for.")
          }),
          execute: async ({ location }, { ctx }) => {
            return { weather: "sunny", temperatureF: 70 };
          },
        }),
      },
    });
  }
}

You can also define the tool parameters as a JSON schema. For example, the tool in the example above can be defined as follows:

parameters: {
  type: "object",
  properties: {
    location: {
      type: "string",
      description: "The location to look up weather information for."
    }
  }
}

When using Zod, parameters must be a z.object({...}) — other top-level types like z.discriminatedUnion() aren't supported because LLM tool-calling APIs require a JSON object. Use optional or nullable fields within the object to model variant behavior. If you use OpenAI with strict mode, prefer .nullable() over .optional().

Best practices

A good tool definition is key to reliable tool use from your LLM. Be specific about what the tool does, when it should or should not be used, what the arguments are for, and what type of return value to expect.

Decorator parameters

Tool decorators transform regular functions into tools the LLM can call. The @function_tool decorator in Python and llm.tool() function in Node.js accept optional parameters to control how a tool is presented to the LLM and when it's available:

Parameter	Type	Default	Description
`name`	`str`	`None`	Override the tool name sent to the LLM. Defaults to the function name.
`description`	`str`	`None`	Override the tool description sent to the LLM. Defaults to the function docstring.
`raw_schema`	`dict`	`None`	A raw JSON function-calling schema. See Creating tools from raw schema.
`flags`	`ToolFlag`	`ToolFlag.NONE`	Behavior flags that control when the tool is available. See Tool flags.

Parameter	Type	Default	Description
`description`	`string`	—	The tool description sent to the LLM. Required.
`parameters`	`ZodSchema` or `JSONSchema`	—	Schema for the tool's input parameters.
`execute`	`Function`	—	The function called when the LLM invokes the tool.
`flags`	`number`	`ToolFlag.NONE`	Behavior flags that control when the tool is available. See Tool flags.

Tool flags

Tool flags control when a tool is available to the LLM. Set them using the flags parameter.

Flag	Description
`NONE`	Default. No special behavior.
`IGNORE_ON_ENTER`	Excludes the tool from any `generate_reply` calls made inside the agent's `on_enter` method.

IGNORE_ON_ENTER is useful for tools that shouldn't be called during the agent's initial greeting. For example, a "confirm address" tool should not be available until the user has actually provided an address:

from livekit.agents import Agent, RunContext, function_tool
from livekit.agents.llm.tool_context import ToolFlag


class AddressAgent(Agent):
    def __init__(self):
        super().__init__(
            instructions="You help users provide their address.",
        )

    async def on_enter(self) -> None:
        self.session.generate_reply(
            instructions="Ask the user to provide their address."
        )

    @function_tool(flags=ToolFlag.IGNORE_ON_ENTER)
    async def confirm_address(self, ctx: RunContext) -> None:
        """Confirm the address provided by the user."""
        # This tool is NOT available during on_enter,
        # preventing the LLM from confirming before the user speaks.
        ...

import { voice, llm } from '@livekit/agents';
import { z } from 'zod';

class AddressAgent extends voice.Agent {
  constructor() {
    super({
      instructions: 'You help users provide their address.',
      tools: {
        confirmAddress: llm.tool({
          description: 'Confirm the address provided by the user.',
          flags: llm.ToolFlag.IGNORE_ON_ENTER,
          execute: async (_, { ctx }) => {
            // This tool is NOT available during onEnter,
            // preventing the LLM from confirming before the user speaks.
          },
        }),
      },
    });
  }

  override async onEnter() {
    this.session.generateReply({
      instructions: 'Ask the user to provide their address.',
    });
  }
}

Tool IDs

ONLY Available in

Python

Every tool has a stable id property that uniquely identifies it. For function tools, the ID defaults to the function name or the explicit name parameter:

@function_tool()
async def lookup_weather(context: RunContext, location: str) -> str:
    """Look up weather for a location."""
    return "sunny"

lookup_weather.id  # "lookup_weather"

@function_tool(name="get_weather")
async def my_func(context: RunContext, location: str) -> str:
    return "sunny"

my_func.id  # "get_weather"

Tool IDs are used for deduplication when calling update_tools(). If the same ID appears more than once, only the last definition is kept. IDs also enable tracking tool changes in conversation history through AgentConfigUpdate.

Arguments

Tool arguments are automatically inferred from the function signature. Parameter names and type hints are sent to the LLM as part of the tool schema.

To provide additional information about arguments beyond the type hints, include it in the tool description or use raw_schema for full control over the argument schema.

Return value

The tool return value is automatically converted to a string before being sent to the LLM. The LLM generates a new reply or additional tool calls based on the return value. Return None or nothing at all to complete the tool silently without requiring a reply from the LLM.

You can use the return value to initiate a handoff to a different Agent within a workflow. Optionally, you can return a tool result to the LLM as well. The tool call and subsequent LLM reply are completed prior to the handoff.

In Python, return a tuple that includes both the Agent instance and the result. If there is no tool result, you can return the new Agent instance by itself.

In Node.js, return an instance of llm.handoff, which specifies the new Agent instance and the tool's return value, if any.

When a handoff occurs, prompt the LLM to inform the user:

@function_tool()
async def my_tool(context: RunContext):
    return SomeAgent(), "Transferring the user to SomeAgent"

const myTool = llm.tool({
  description: 'Example tool that hands off to another agent',
  execute: async (_, { ctx }) => {
    return llm.handoff({
      agent: new SomeAgent(),
      returns: 'Transferring the user to SomeAgent',
    });
  },
});

Structured output

Some LLMs can return structured JSON payloads that define behavior like TTS style separately from the spoken text.

In this example, the LLM streams a JSON object that has both TTS style directives and a spoken response. The TTS style is applied once per message and the spoken response is stripped out for downstream processing. The example contains two code blocks: the format of the JSON and the parsing logic, and an implementation example in an agent workflow.

Tip

This example uses a cast for the LLM and TTS instances. It's specifically built to work with OpenAI (or OpenAI-compatible) APIs. Read more in the OpenAI Structured Outputs docs.

See the following example for the full implementation:

Structured Output

Handle structured output from the LLM by overriding the `llm_node` and `tts_node`.

Core components: Definition and parsing

This code block has two components: the ResponseEmotion schema definition and the process_structured_output parsing function.

ResponseEmotion: Defines the structure of the JSON object, with both the TTS style directives (voice_instructions) and the spoken response.
process_structured_output: Incrementally parses the JSON object, optionally applies a callback for TTS style directives, and only streams the spoken response.

class ResponseEmotion(TypedDict):
    voice_instructions: Annotated[
        str,
        Field(..., description="Concise TTS directive for tone, emotion, intonation, and speed"),
    ]
    response: str

async def process_structured_output(
    text: AsyncIterable[str],
    callback: Optional[Callable[[ResponseEmotion], None]] = None,
) -> AsyncIterable[str]:
    last_response = ""
    acc_text = ""
    async for chunk in text:
        acc_text += chunk
        try:
            resp: ResponseEmotion = from_json(acc_text, allow_partial="trailing-strings")
        except ValueError:
            continue

        if callback:
            callback(resp)

        if not resp.get("response"):
            continue

        new_delta = resp["response"][len(last_response) :]
        if new_delta:
            yield new_delta
        last_response = resp["response"]

Agent method implementation

This agent implementation example overrides default behavior with custom logic using the LLM and TTS nodes: llm_node and tts_node.

llm_node: Casts the LLM instance to the OpenAI type, streams the output using the ResponseEmotion schema, and parses it into structured JSON.
tts_node: Processes the streamed JSON with a callback that applies the TTS style directives (voice_instructions), then streams the audio from the response.

async def llm_node(
    self, chat_ctx: ChatContext, tools: list[FunctionTool], model_settings: ModelSettings
):
    # not all LLMs support structured output, so we need to cast to the specific LLM type
    llm = cast(openai.LLM, self.llm)
    tool_choice = model_settings.tool_choice if model_settings else NOT_GIVEN
    async with llm.chat(
        chat_ctx=chat_ctx,
        tools=tools,
        tool_choice=tool_choice,
        response_format=ResponseEmotion,
    ) as stream:
        async for chunk in stream:
            yield chunk

async def tts_node(self, text: AsyncIterable[str], model_settings: ModelSettings):
    instruction_updated = False

    def output_processed(resp: ResponseEmotion):
        nonlocal instruction_updated
        if resp.get("voice_instructions") and resp.get("response") and not instruction_updated:
            # when the response isn't empty, we can assume voice_instructions is complete.
            # (if the LLM sent the fields in the right order)
            instruction_updated = True
            logger.info(
                f"Applying TTS instructions before generating response audio: "
                f'"{resp["voice_instructions"]}"'
            )

            tts = cast(openai.TTS, self.tts)
            tts.update_options(instructions=resp["voice_instructions"])

    # process_structured_output strips the TTS instructions and only synthesizes the verbal part
    # of the LLM output
    return Agent.default.tts_node(
        self, process_structured_output(text, callback=output_processed), model_settings
    )

RunContext

Tools include support for a special context argument. This contains access to the current session, function_call, speech_handle, and userdata. Consult the documentation on speech and state within workflows for more information about how to use these features.

Using speech in tool calls

You can generate agent speech from within a tool using session.say() or session.generate_reply(). However, when waiting for speech to complete inside a tool call, you must use ctx.wait_for_playout() instead of directly awaiting the speech handle.

@function_tool()
async def process_order(self, context: RunContext, order_id: str):
    """Process an order and notify the user."""
    
    # Generate speech to inform the user
    self.session.generate_reply(
        instructions=f"Processing order {order_id}. This may take a moment."
    )
    
    # Wait for speech to complete using context.wait_for_playout()
    # Do NOT await the speech handle directly in tool calls
    await context.wait_for_playout()
    
    # Now perform the actual order processing
    result = await process_order_internal(order_id)
    
    # Generate follow-up speech
    self.session.generate_reply(
        instructions=f"Order {order_id} has been processed successfully."
    )
    await context.wait_for_playout()
    
    return result

const processOrder = llm.tool({
  description: 'Process an order and notify the user.',
  parameters: z.object({
    orderId: z.string(),
  }),
  execute: async ({ orderId }, { ctx }) => {
    // Generate speech to inform the user
    ctx.session.generateReply({
      instructions: `Processing order ${orderId}. This may take a moment.`,
    });
    
    // Wait for speech to complete using ctx.waitForPlayout()
    // Do NOT await the speech handle directly in tool calls
    await ctx.waitForPlayout();
    
    // Now perform the actual order processing
    const result = await processOrderInternal(orderId);
    
    // Generate follow-up speech
    ctx.session.generateReply({
      instructions: `Order ${orderId} has been processed successfully.`,
    });
    await ctx.waitForPlayout();
    
    return result;
  },
});

Why use wait_for_playout()?

You can't directly await generate_reply() or the returned SpeechHandle inside tool calls. You must use ctx.wait_for_playout() to properly coordinate speech playback alongside tool execution.

Outside of tool calls, you can directly await generate_reply() or the speech handle.

Interruptions

By default, tools can be interrupted if the user speaks. When interrupted, the tool is removed from the history and the result, if any, is ignored.

The speech handle has utilities for detecting interruption:

wait_for_result = asyncio.ensure_future(self._a_long_running_task(query))
await run_ctx.speech_handle.wait_if_not_interrupted([wait_for_result])

if run_ctx.speech_handle.interrupted:
   # interruption occurred, you should cancel / clean up your tasks
   wait_for_result.cancel()
   return None # it doesn't matter what you return, the tool no longer exists from LLM perspective
else: 
  # your work finished  without interruption

If your tool is taking external actions that can't be rolled back, you should instead disable interruptions by calling run_ctx.disallow_interruptions() at the start of your tool to ensure user speech won't interrupt the agent's task.

For best practices on providing feedback to the user during long-running tool calls, see the section on user feedback in the External data and RAG guide.

To play a pre-synthesized hold message (such as "let me check that for you") while a tool executes, see Using cached TTS in a tool call. This avoids TTS latency during tool execution and lets you cancel the message early if the external API returns quickly.

Long running tools

Interruptions during long-running tools.

Adding tools dynamically

You can exercise more control over the tools available by setting the tools argument directly.

To share a tool between multiple agents, define it outside of their class and then provide it to each. The RunContext is especially useful for this purpose to access the current session, agent, and state.

Tools set in the tools value are available alongside any registered within the class using the @function_tool decorator.

from livekit.agents import function_tool, Agent, RunContext

@function_tool()
async def lookup_user(
    context: RunContext,
    user_id: str,
) -> dict:
    """Look up a user's information by ID."""

    return {"name": "John Doe", "email": "john.doe@example.com"}


class AgentA(Agent):
    def __init__(self):
        super().__init__(
            tools=[lookup_user],
            # ...
        )


class AgentB(Agent):
    def __init__(self):
        super().__init__(
            tools=[lookup_user],
            # ...
        )

import { voice, llm } from '@livekit/agents';
import { z } from 'zod';

const lookupUser = llm.tool({
  description: 'Look up a user\'s information by ID.',
  parameters: z.object({
    userId: z.string(),
  }),
  execute: async ({ userId }, { ctx }) => {
    return { name: "John Doe", email: "john.doe@example.com" };
  },
});

class AgentA extends voice.Agent {
  constructor() {
    super({
      tools: {
        lookupUser,
      },
      // ...
    });
  }
}

class AgentB extends voice.Agent {
  constructor() {
    super({
      tools: {
        lookupUser,
      },
      // ...
    });
  }
}

Use agent.update_tools() to update available tools after creating an agent. This replaces all tools, including those registered automatically within the agent class. To reference existing tools before replacement, access the agent.tools property:

# add a tool
await agent.update_tools(agent.tools + [tool_a])

# remove a tool
await agent.update_tools(agent.tools - [tool_a]) 

# replace all tools
await agent.update_tools([tool_a, tool_b])

// add a tool
await agent.updateTools({ ...agent.toolCtx, toolA })

// remove a tool
const { toolA, ...rest } = agent.toolCtx;
await agent.updateTools({ ...rest }) 

// replace all tools
await agent.updateTools({ toolA, toolB})

Creating tools programmatically

To create a tool on the fly, use function_tool as a function rather than as a decorator. You must supply a name, description, and callable function. This is useful to compose specific tools based on the same underlying code or load them from external sources such as a database or Model Context Protocol (MCP) server.

In the following example, the app has a single function to set any user profile field but gives the agent one tool per field for improved reliability:

from livekit.agents import function_tool, RunContext

class Assistant(Agent):
    def _set_profile_field_func_for(self, field: str):
        async def set_value(context: RunContext, value: str):
            # custom logic to set input
            return f"field {field} was set to {value}"

        return set_value

    def __init__(self):
        super().__init__(
            tools=[
                function_tool(self._set_profile_field_func_for("phone"),
                              name="set_phone_number",
                              description="Call this function when user has provided their phone number."),
                function_tool(self._set_profile_field_func_for("email"),
                              name="set_email",
                              description="Call this function when user has provided their email."),
                # ... other tools ...
            ],
            # instructions, etc ...
        )

import { voice, llm } from '@livekit/agents';
import { z } from 'zod';

class Assistant extends voice.Agent {
  private createSetProfileFieldTool(field: string) {
    return llm.tool({
      description: `Call this function when user has provided their ${field}.`,
      parameters: z.object({
        value: z.string().describe(`The ${field} value to set`),
      }),
      execute: async ({ value }, { ctx }) => {
        // custom logic to set input
        return `field ${field} was set to ${value}`;
      },
    });
  }

  constructor() {
    super({
      tools: {
        setPhoneNumber: this.createSetProfileFieldTool("phone number"),
        setEmail: this.createSetProfileFieldTool("email"),
        // ... other tools ...
      },
      // instructions, etc ...
    });
  }
}

Creating tools from raw schema

For advanced use cases, you can create tools directly from a raw function calling schema. This is useful when integrating with existing function definitions, loading tools from external sources, or working with schemas that don't map cleanly to Python function signatures.

Use the raw_schema parameter in the @function_tool decorator to provide the full function schema:

from livekit.agents import function_tool, RunContext

raw_schema = {
    "type": "function",
    "name": "get_weather",
    "description": "Get weather for a given location.",
    "parameters": {
        "type": "object",
        "properties": {
            "location": {
                "type": "string",
                "description": "City and country e.g. New York"
            }
        },
        "required": [
            "location"
        ],
        "additionalProperties": False
    }
}

@function_tool(raw_schema=raw_schema)
async def get_weather(raw_arguments: dict[str, object], context: RunContext):
    location = raw_arguments["location"]
    
    # Your implementation here
    return f"The weather of {location} is ..."

import { voice, llm } from '@livekit/agents';

const rawSchema = {
  type: 'object',
  properties: {
    location: {
      type: 'string',
      description: 'City and country e.g. New York'
    }
  },
  required: ['location'],
  additionalProperties: false
};

const getWeather = llm.tool({
  description: 'Get weather for a given location.',
  parameters: rawSchema,
  execute: async ({ location }, { ctx }) => {
    // Your implementation here
    return `The weather of ${location} is ...`;
  },
});

When using raw schemas, function parameters are passed to your handler as a dictionary named raw_arguments. You can extract values from this dictionary using the parameter names defined in your schema.

You can also create tools programmatically using function_tool as a function with raw schemas:

from livekit.agents import function_tool

def create_database_tool(table_name: str, operation: str):
    schema = {
        "type": "function",
        "name": f"{operation}_{table_name}",
        "description": f"Perform {operation} operation on {table_name} table",
        "parameters": {
            "type": "object",
            "properties": {
                "record_id": {
                    "type": "string",
                    "description": f"ID of the record to {operation}"
                }
            },
            "required": ["record_id"]
        }
    }
    
    async def handler(raw_arguments: dict[str, object], context: RunContext):
        record_id = raw_arguments["record_id"]
        # Perform database operation
        return f"Performed {operation} on {table_name} for record {record_id}"
    
    return function_tool(handler, raw_schema=schema)

# Create tools dynamically
user_tools = [
    create_database_tool("users", "read"),
    create_database_tool("users", "update"),  
    create_database_tool("users", "delete")
]

class DataAgent(Agent):
    def __init__(self):
        super().__init__(
            instructions="You are a database assistant.",
            tools=user_tools,
        )

import { voice, llm } from '@livekit/agents';
import { z } from 'zod';

function createDatabaseTool(tableName: string, operation: string) {
  return llm.tool({
    description: `Perform ${operation} operation on ${tableName} table`,
    parameters: z.object({
      recordId: z.string().describe(`ID of the record to ${operation}`),
    }),
    execute: async ({ recordId }, { ctx }) => {
      // Perform database operation
      return `Performed ${operation} on ${tableName} for record ${recordId}`;
    },
  });
}

// Create tools dynamically
const dataAgent = new voice.Agent({
  instructions: 'You are a database assistant.',
  tools: {
	  readUsers: createDatabaseTool("users", "read"),
	  updateUsers: createDatabaseTool("users", "update"),
	  deleteUsers: createDatabaseTool("users", "delete"),
	},
});

Toolsets

ONLY Available in

Python

A Toolset groups related tools under a single ID. Use toolsets to bundle tools that work together as a unit, making it easier to add or remove them as a group.

Create a custom subclass of Toolset to group tools:

from livekit.agents import Agent, RunContext, function_tool
from livekit.agents.llm import Tool, Toolset

class WeatherToolset(Toolset):
    def __init__(self):
        super().__init__(id="weather_tools")

        self._lookup = function_tool(
            self._lookup_weather,
            name="lookup_weather",
            description="Look up current weather for a location.",
        )
        self._forecast = function_tool(
            self._get_forecast,
            name="get_forecast",
            description="Get a multi-day weather forecast.",
        )

    async def _lookup_weather(self, context: RunContext, location: str) -> str:
        return f"The weather in {location} is sunny."

    async def _get_forecast(
        self, context: RunContext, location: str, days: int = 3
    ) -> str:
        return f"{days}-day forecast for {location}: sunny."

    @property
    def tools(self) -> list[Tool]:
        return [self._lookup, self._forecast]


class MyAgent(Agent):
    def __init__(self):
        super().__init__(
            instructions="You are a helpful weather assistant.",
            tools=[WeatherToolset()],
        )

Toolsets are flattened automatically when sent to the LLM. You can add or remove a toolset as a group using update_tools().

Note

Tool names must be unique across all tools and toolsets. If a toolset and a standalone tool, or two toolsets, share a tool with the same name, the agent raises a ValueError.

Tracking configuration changes

ONLY Available in

Python

When you update an agent's tools or instructions at runtime, an AgentConfigUpdate is automatically added to the conversation history. This record includes:

instructions: The updated instructions, if changed.
tools_added: Names of any tools that were added.
tools_removed: Names of any tools that were removed.

# Tool changes are tracked automatically
await agent.update_tools([new_tool_a, new_tool_b])

# Instruction changes are also tracked
await agent.update_instructions("You are now a support agent.")

This gives the LLM visibility into configuration changes, which is useful in multi-agent workflows where agents switch and have different tool sets. For example, after the calls above, the conversation history includes records like:

# agent.chat_ctx.items
[
    ChatMessage(role="user", content=["What's the weather?"]),
    ChatMessage(role="assistant", content=["Let me check..."]),
    AgentConfigUpdate(
        tools_added=["new_tool_a", "new_tool_b"],
        tools_removed=["old_tool"],
    ),
    AgentConfigUpdate(
        instructions="You are now a support agent.",
    ),
]

To exclude configuration updates when copying or serializing conversation history, use exclude_config_update:

ctx_copy = chat_ctx.copy(exclude_config_update=True)

Error handling

Raise the ToolError exception to return an error to the LLM in place of a response. You can include a custom message to describe the error and/or recovery options.

@function_tool()
async def lookup_weather(
    self,
    context: RunContext,
    location: str,
) -> dict[str, Any]:
    if location == "mars":
        raise ToolError("This location is coming soon. Please join our mailing list to stay updated.")
    else:
        return {"weather": "sunny", "temperature_f": 70}

import { llm } from '@livekit/agents';
import { z } from 'zod';

const lookupWeather = llm.tool({
  description: 'Look up weather information for a location',
  parameters: z.object({
    location: z.string().describe('The location to get weather for'),
  }),
  execute: async ({ location }, { ctx }) => {
    if (location === "mars") {
      throw new llm.ToolError("This location is coming soon. Please join our mailing list to stay updated.");
    }
    return { weather: "sunny", temperatureF: 70 };
  },
});