Chat Completions
The Chat Completions API generates responses based on a conversation history. It supports multi-turn conversations with system, user, and assistant messages.Basic Usage
Message Roles
| Role | Description |
|---|---|
system | Sets the behavior and context for the assistant |
user | Represents the human user’s input |
assistant | Represents previous assistant responses |
Multi-turn Conversation
Parameters
| Parameter | Type | Default | Description |
|---|---|---|---|
model | str | Required | Model identifier (e.g., gpt-4o-mini) |
messages | list | Required | List of message objects |
max_tokens | int | Model default | Maximum tokens to generate |
temperature | float | 1.0 | Randomness (0.0 to 2.0) |
top_p | float | 1.0 | Nucleus sampling parameter |
stop | list[str] | None | Stop sequences |
fallbacks | list[str] | None | Fallback models |
Example with Parameters
Response Structure
Using ChatMessage Objects
You can also useChatMessage objects instead of dictionaries:
Force Specific Provider
Use theprovider/model syntax to force a specific provider:
Available Models
Common models include:| Model | Provider | Description |
|---|---|---|
gpt-4o-mini | OpenAI | Fast, efficient model |
gpt-4o | OpenAI | Most capable GPT-4 |
claude-3-haiku | Anthropic | Fast Claude model |
claude-3-5-sonnet | Anthropic | Balanced Claude model |
llama-3.1-8b | Various | Open-source Llama |