Supported Models
All LLM models and providers supported by AgentRouter
Supported Models
AgentRouter supports models from multiple mainstream LLM providers. Below is the complete list of models and pricing information.
Automatic Routing Rules
AgentRouter automatically routes to the corresponding provider based on model name:
| Model Name Keyword | Provider | API Format |
|---|---|---|
gpt- | OpenAI | OpenAI |
deepseek | DeepSeek | OpenAI |
moonshot, moon | Moonshot | OpenAI |
claude- | Anthropic | Anthropic |
glm- | Zhipu AI | OpenAI |
OpenAI Models
GPT-5 Series
| Model | Input Price | Output Price | Context | Notes |
|---|---|---|---|---|
gpt-5 | $30.00/M | $60.00/M | 128K | Latest flagship model |
GPT-4 Series
| Model | Input Price | Output Price | Context | Notes |
|---|---|---|---|---|
gpt-4o | $2.50/M | $10.00/M | 128K | GPT-4 Omni, multimodal |
gpt-4o-mini | $0.15/M | $0.60/M | 128K | Economy GPT-4 |
GPT-3.5 Series
| Model | Input Price | Output Price | Context | Notes |
|---|---|---|---|---|
gpt-3.5-turbo | $0.50/M | $1.50/M | 16K | Classic cost-effective model |
Usage Example
from openai import OpenAI
client = OpenAI(
api_key="sk-ar-your-api-key",
base_url="https://your-agentrouter.com/v1"
)
# Use GPT-4o
response = client.chat.completions.create(
model="gpt-4o",
messages=[{"role": "user", "content": "Hello!"}]
)Anthropic Claude Models
Claude 4 Series
| Model | Input Price | Output Price | Context | Notes |
|---|---|---|---|---|
claude-opus-4-20250514 | $15.00/M | $75.00/M | 200K | Strongest reasoning |
claude-opus-4-20250514-thinking | $15.00/M | $75.00/M | 200K | With thinking chain |
claude-sonnet-4-20250514 | $3.00/M | $15.00/M | 200K | Balanced performance |
claude-sonnet-4-20250514-thinking | $3.00/M | $15.00/M | 200K | With thinking chain |
claude-haiku-4-5-20251001 | $0.80/M | $4.00/M | 200K | Fast response |
claude-sonnet-4-5-20250929 | $3.00/M | $15.00/M | 200K | Optimized Sonnet |
claude-opus-4-5-20251101 | $15.00/M | $75.00/M | 200K | Optimized Opus |
Claude 3.7 Series
| Model | Input Price | Output Price | Context | Notes |
|---|---|---|---|---|
claude-3-7-sonnet-20250219 | $3.00/M | $15.00/M | 200K | Upgraded Sonnet |
claude-3-7-sonnet-20250219-thinking | $3.00/M | $15.00/M | 200K | With thinking chain |
Claude 3.5 Series
| Model | Input Price | Output Price | Context | Notes |
|---|---|---|---|---|
claude-3-5-sonnet-20241022 | $3.00/M | $15.00/M | 200K | Recommended |
claude-3-5-sonnet-20241020 | $3.00/M | $15.00/M | 200K | Stable version |
claude-3-5-haiku-20241022 | $0.80/M | $4.00/M | 200K | High cost-efficiency |
Claude 3 Series
| Model | Input Price | Output Price | Context | Notes |
|---|---|---|---|---|
claude-3-opus-20240229 | $15.00/M | $75.00/M | 200K | Flagship model |
claude-3-sonnet-20240229 | $3.00/M | $15.00/M | 200K | Balanced choice |
claude-3-haiku-20240307 | $0.25/M | $1.25/M | 200K | Fastest response |
Usage Example
from anthropic import Anthropic
client = Anthropic(
api_key="sk-ar-your-api-key",
base_url="https://your-agentrouter.com"
)
# Use Claude 3.5 Sonnet
message = client.messages.create(
model="claude-3-5-sonnet-20241022",
max_tokens=1024,
messages=[{"role": "user", "content": "Hello!"}]
)DeepSeek Models
DeepSeek V3 Series
| Model | Input Price | Output Price | Context | Notes |
|---|---|---|---|---|
deepseek-chat | $0.14/M | $0.28/M | 64K | Chat model |
deepseek-v3.2 | $0.14/M | $0.28/M | 64K | Latest version |
deepseek-v3.1 | $0.14/M | $0.28/M | 64K | Stable version |
DeepSeek R1 Series
| Model | Input Price | Output Price | Context | Notes |
|---|---|---|---|---|
deepseek-r1-0528 | $0.55/M | $2.19/M | 64K | Enhanced reasoning |
Usage Example
from openai import OpenAI
client = OpenAI(
api_key="sk-ar-your-api-key",
base_url="https://your-agentrouter.com/v1"
)
# Use DeepSeek
response = client.chat.completions.create(
model="deepseek-chat",
messages=[{"role": "user", "content": "Hello!"}]
)Zhipu AI GLM Models
| Model | Input Price | Output Price | Context | Notes |
|---|---|---|---|---|
glm-4.5 | $0.50/M | $0.50/M | 128K | GLM-4 series |
glm-4.6 | $0.50/M | $0.50/M | 128K | Latest version |
Usage Example
from openai import OpenAI
client = OpenAI(
api_key="sk-ar-your-api-key",
base_url="https://your-agentrouter.com/v1"
)
# Use GLM
response = client.chat.completions.create(
model="glm-4.5",
messages=[{"role": "user", "content": "Hello!"}]
)Moonshot
Moonshot provides Kimi series models, using OpenAI-compatible format:
from openai import OpenAI
client = OpenAI(
api_key="sk-ar-your-api-key",
base_url="https://your-agentrouter.com/v1"
)
# Use Moonshot
response = client.chat.completions.create(
model="moonshot-v1-8k",
messages=[{"role": "user", "content": "Hello!"}]
)Note: For specific Moonshot model pricing, please contact Moonshot official.
Pricing Explanation
Pricing Unit
All prices are in USD/M tokens (US Dollars per Million tokens).
Example:
gpt-4o: Input $2.50/M, Output $10.00/M- Using 1000 input tokens + 2000 output tokens
- Cost = (1000/1,000,000) × $2.50 + (2000/1,000,000) × $10.00
- Cost = $0.0025 + $0.02 = $0.0225
Billing Method
- Pay-as-you-go - Pay only for actual tokens consumed
- Deducted from wallet - Each request automatically deducts from wallet balance
- Insufficient balance - Returns
402 Payment Requirederror
Cost Inquiry
On the Wallet page you can view:
- Current balance
- Transaction history
- Detailed cost per request
- Consumption statistics by model
Model Selection Guide
By Use Case
Daily Conversation:
- Cost-effective:
gpt-3.5-turbo,deepseek-chat,claude-3-haiku - High quality:
gpt-4o,claude-3-5-sonnet
Code Generation:
- Recommended:
gpt-4o,claude-3-5-sonnet,deepseek-chat
Long Text Analysis:
- Recommended:
claude-3-5-sonnet(200K context) - Alternative:
gpt-4o(128K context)
Complex Reasoning:
- Recommended:
gpt-5,claude-opus-4,deepseek-r1
Fast Response:
- Recommended:
claude-3-haiku,gpt-4o-mini,deepseek-chat
By Budget
Low Cost (< $1/M):
deepseek-chat: $0.14/M (input)claude-3-haiku: $0.25/M (input)gpt-4o-mini: $0.15/M (input)
Medium Cost ($1-5/M):
gpt-4o: $2.50/M (input)claude-3-5-sonnet: $3.00/M (input)
Premium Models (> $10/M):
gpt-5: $30.00/M (input)claude-opus-4: $15.00/M (input)
Adding New Models
AgentRouter continuously adds new model support. If the model you need is not in the list:
- Submit a GitHub Issue
- Contact technical support
- For self-hosted deployments, modify
lib/model-pricing.tsto add new models
Next Steps
- Getting Started - Start using these models
- OpenAI Integration Guide - Using OpenAI format models
- Anthropic Integration Guide - Using Anthropic format models
- Wallet & Billing - Learn about billing details