LiveKit docs › Models › LLM › Additional models › Together AI

---

# Together AI LLM plugin guide

> How to use Together AI Llama models with LiveKit Agents.

Available in:
- [x] Node.js
- [x] Python

## Overview

This plugin allows you to use [Together AI](https://www.together.ai/) as an LLM provider for your voice agents. Together AI compatibility is provided by the OpenAI plugin using the Together AI Chat Completions API.

## Usage

Install the OpenAI plugin to add Together AI support:

**Python**:

```shell
uv add "livekit-agents[openai]~=1.5"

```

---

**Node.js**:

```shell
pnpm add@livekit/agents-plugin-openai@1.x

```

Set the following environment variable in your `.env` file:

```shell
TOGETHER_API_KEY=<your-together-api-key>

```

Create a Together AI LLM using the `with_together` method:

**Python**:

```python
from livekit.plugins import openai

session = AgentSession(
    llm=openai.LLM.with_together(
        model="meta-llama/Meta-Llama-3.1-8B-Instruct-Turbo",
    ),
    # ... tts, stt, vad, turn_handling, etc.
)

```

---

**Node.js**:

```typescript
import * as openai from '@livekit/agents-plugin-openai';

const session = new voice.AgentSession(
    llm: new openai.LLM.withTogether(
        model: "meta-llama/Meta-Llama-3.1-8B-Instruct-Turbo",
    ),
    // ... tts, stt, vad, turnHandling, etc.
);

```

## Parameters

This section describes some of the available parameters. For a complete reference of all available parameters, see the plugin reference links in the [Additional resources](#additional-resources) section.

- **`model`** _(str | TogetherChatModels)_ (optional) - Default: `meta-llama/Meta-Llama-3.1-8B-Instruct-Turbo`: Model to use for inference. To learn more, see [supported models](https://docs.together.ai/docs/inference-models).

- **`temperature`** _(float)_ (optional) - Default: `1.0`: Sampling temperature that controls the randomness of the model's output. Higher values make the output more random, while lower values make it more focused and deterministic. Range of valid values can vary by model.

Valid values are between `0` and `1`.

- **`parallel_tool_calls`** _(bool)_ (optional): Controls whether the model can make multiple tool calls in parallel. When enabled, the model can make multiple tool calls simultaneously, which can improve performance for complex tasks.

- **`tool_choice`** _(ToolChoice | Literal['auto', 'required', 'none'])_ (optional) - Default: `auto`: Controls how the model uses tools. String options are as follows:

- `'auto'`: Let the model decide.
- `'required'`: Force tool usage.
- `'none'`: Disable tool usage.

## Additional resources

The following links provide more information about the Together AI LLM integration.

- **[Together AI docs](https://docs.together.ai/intro)**: Together AI API documentation.

- **[Voice AI quickstart](https://docs.livekit.io/agents/start/voice-ai.md)**: Get started with LiveKit Agents and Together AI.

---

This document was rendered at 2026-06-07T11:35:12.994Z.
For the latest version of this document, see [https://docs.livekit.io/agents/models/llm/together.md](https://docs.livekit.io/agents/models/llm/together.md).

To explore all LiveKit documentation, see [llms.txt](https://docs.livekit.io/llms.txt).