Gemini Live API integration guide

How to use the Gemini Live API with LiveKit Agents.

Try the playground

Chat with a voice assistant built with LiveKit and the Gemini Live API

Try the playground

Overview

Google's Gemini Live API enables low-latency, two-way interactions that use text, audio, and video input, with audio and text output. LiveKit's Google plugin includes a RealtimeModel class that allows you to use this API to create agents with natural, human-like voice conversations.

Quick reference

This section includes a basic usage example and some reference material. For links to more detailed documentation, see Additional resources.

Installation

Install the Google plugin from PyPI:

pip install "livekit-agents[google]~=1.0"

Authentication

The Google plugin requires authentication based on your chosen service:

  • For Vertex AI, you must set the GOOGLE_APPLICATION_CREDENTIALS environment variable to the path of the service account key file.
  • For Google Gemini API, set the GOOGLE_API_KEY environment variable.

Usage

Use the Gemini Live API within an AgentSession. For example, you can use it in the Voice AI quickstart.

from livekit.plugins import google
session = AgentSession(
llm=google.beta.realtime.RealtimeModel(
model="gemini-2.0-flash-exp",
voice="Puck",
temperature=0.8,
instructions="You are a helpful assistant",
),
)

Parameters

This section describes some of the available parameters. For a complete reference of all available parameters, see the plugin reference.

instructionsstringOptional

System instructions to better control the model's output and specify tone and sentiment of responses. To learn more, see System instructions.

modelLiveAPIModels | stringRequiredDefault: gemini-2.0-flash-exp

Live API model to use.

api_keystringRequiredEnv: GOOGLE_API_KEY

Google Gemini API key.

voiceVoice | stringRequiredDefault: Puck

Name of the Gemini Live API voice. For a full list, see Voices.

modalitieslist[Modality]OptionalDefault: ["AUDIO"]

List of response modalities to use, such as ["TEXT", "AUDIO"].

vertexaibooleanRequiredDefault: false

If set to true, use Vertex AI.

projectstringOptionalEnv: GOOGLE_CLOUD_PROJECT

Google Cloud project ID to use for the API (if vertextai=True). By default, it uses the project in the service account key file (set using the GOOGLE_APPLICATION_CREDENTIALS environment variable).

locationstringOptionalEnv: GOOGLE_CLOUD_LOCATION

Google Cloud location to use for the API (if vertextai=True). By default, it uses the location from the service account key file or us-central1.

temperaturefloatOptional

A measure of randomness of completions. A lower temperature is more deterministic. To learn more, see Temperature.

Turn detection

The Gemini Live API includes built-in VAD-based turn detection, which is currently the only supported turn detection method.

Additional resources

The following resources provide more information about using Gemini with LiveKit Agents.