Recipes

This page is an index of quickstarts, guides, and short recipes focused on particular use cases.

Quickstart

Voice AI quickstart

Create a voice AI agent in less than 10 minutes.
Agent WorkflowsComplete Demo

Medical Office Triage

Agent that triages patients based on symptoms and medical history.
Agent WorkflowsComplete Demo

Personal Shopper

AI shopping assistant that helps find products based on user preferences.
Agent WorkflowsComplete Demo

Restaurant Agent

A restaurant front-of-house agent that can take orders, add items to a shared cart, and checkout.
Realtime DataComplete Demo

LivePaint

A realtime drawing game where players compete to complete a drawing prompt while being judged by a realtime AI agent.
Voice Processing

Push-to-Talk Agent

A voice AI agent that uses push-to-talk for controlled multi-participant conversations, only enabling audio input when explicitly triggered.
Audio

Background Audio

A voice AI agent with background audio for thinking states and ambiance.
Voice Processing

Uninterruptable Agent

An agent that continues speaking without being interrupted.
Voice Processing

Change Language

Agent that can switch between different languages during conversation.
Audio

TTS Comparison

Compare different text-to-speech providers side by side.
Voice Processing

Transcriber

Real-time speech transcription with high accuracy.
Voice Processing

Keyword Detection

Detect specific keywords in speech in real-time.
Telephony

Using Twilio Voice

Use TwiML to accept incoming calls and bridge Twilio conferencing to LiveKit via SIP.
Telephony

IVR Agent

Build a voice agent that can call external voice lines and respond to IVR flows using DTMF tones.
TelephonyComplete Demo

Company Directory

Build a AI company directory agent. The agent can respond to DTMF tones and voice prompts, then redirect callers.
Telephony

Phone Caller

Agent that can make outbound phone calls and handle conversations.
Telephony

SIP Warm Handoff

Transfer calls from an AI agent to a human operator seamlessly.
Telephony

SIP Lifecycle

Complete lifecycle management for SIP calls.
Telephony

Answer Incoming Calls

Set up an agent to answer incoming SIP calls.
Telephony

Survey Caller

Automated survey calling system.
Advanced LLM

Chain of thought agent

Build an agent for chain-of-thought reasoning using the `llm_node` to clean the text before TTS.
RAG

LlamaIndex RAG

A voice AI agent that uses LlamaIndex for RAG to answer questions from a knowledge base.
RAGAdvanced LLM

RAG

An agent that can answer questions using RAG.
Tool UseComplete Demo

Moviefone

This agent uses function calling and the OpenAI API to search for movies and give you realtime information about showtimes.
Advanced LLM

Context Variables

Maintain conversation context across interactions.
Advanced LLM

Interrupt User

Example of how to implement user interruption in conversations.
Advanced LLM

LLM Content Filter

Implement content filtering in the `llm_node`.
Advanced LLM

Simple Content Filter

Basic content filtering implementation.
Advanced LLM

Replacing LLM Output

Example of modifying LLM output before processing.
Vision

Vision AI agent

A voice AI agent with live video input powered by Gemini.
Robotics

Raspberry Pi Transcriber

Run transcription on Raspberry Pi hardware.
Voice Processing

Pipeline Translator

Implement translation in the processing pipeline.
Voice Processing

TTS Translator

Translation with text-to-speech capabilities.
Advanced LLMMetrics

LLM Metrics

Track and analyze LLM performance metrics.
Voice ProcessingMetrics

STT Metrics

Track and analyze speech-to-text performance metrics.
AudioMetrics

TTS Metrics

Track and analyze text-to-speech performance metrics.
Voice ProcessingMetrics

VAD Metrics

Track and analyze voice activity detection metrics.
Audio

Playing Audio

Play audio files during agent interactions.
Audio

Sound Repeater

Simple sound repeating demo for testing audio pipelines.