Skip to main content

Images and video

Process images and video input, and send visual output with avatars and byte streams.

Overview

LiveKit Agents supports images and video as both input and output modalities. On the input side, you can add images to your agent's chat context, receive images from the frontend, sample video frames, or enable live video input with a supported realtime model. On the output side, you can send images to the frontend using byte streams or add a virtual avatar for lifelike video output.

In this section

This page provides an overview of image and video capabilities. The following pages in this section cover each topic in detail:

TopicDescription
ImagesAdd images to your agent's context, receive images from the frontend, and send images back to users.
VideoSample video frames, enable live video input, and add virtual avatars for video output.

Additional resources