DeepInfra exposes an Anthropic-compatible Messages API. This means tools that target the Anthropic API — Claude Code, the Anthropic Python and TypeScript SDKs, and any framework with an Anthropic adapter — can point at DeepInfra and use open-source models.Documentation Index
Fetch the complete documentation index at: https://docs.deepinfra.com/llms.txt
Use this file to discover all available pages before exploring further.
Endpoint
| Endpoint | Description |
|---|---|
POST /anthropic/v1/messages | Create a message (chat completion) |
POST /anthropic/v1/messages/count_tokens | Count tokens for a message request |
Authentication
Both standard Anthropic authentication methods are supported:| Header | Example |
|---|---|
Authorization | Bearer $DEEPINFRA_TOKEN |
x-api-key | $DEEPINFRA_TOKEN |
anthropic-version and anthropic-beta headers as needed.
Using the Anthropic SDK
Using with Claude Code
Claude Code can use DeepInfra as its backend. To keep your normal Claude Code setup untouched, add a dedicated shell function to your~/.bashrc or ~/.zshrc:
deepinfra instead of claude to launch Claude Code via DeepInfra. Your regular claude command stays unchanged.
Model override environment variables
Claude Code uses model aliases (opus, sonnet, haiku) internally. You can remap each alias to a DeepInfra model using these environment variables:
| Environment variable | Description | Example |
|---|---|---|
ANTHROPIC_MODEL | The primary model Claude Code uses for all tasks | deepseek-ai/DeepSeek-V3.1-Terminus |
ANTHROPIC_DEFAULT_OPUS_MODEL | Model used for the opus alias (complex reasoning) | deepseek-ai/DeepSeek-R1 |
ANTHROPIC_DEFAULT_SONNET_MODEL | Model used for the sonnet alias (daily coding) | deepseek-ai/DeepSeek-V3.1-Terminus |
ANTHROPIC_DEFAULT_HAIKU_MODEL | Model used for the haiku alias and background tasks (tab completions, commit messages) | Qwen/Qwen3-30B-A3B |
CLAUDE_CODE_SUBAGENT_MODEL | Model used for subagents (parallel background tasks) | Qwen/Qwen3-30B-A3B |
ANTHROPIC_DEFAULT_HAIKU_MODEL is used for lightweight background tasks like tab completions and commit messages. Pick a fast, cheap model here to keep costs low. The older ANTHROPIC_SMALL_FAST_MODEL variable is deprecated — use ANTHROPIC_DEFAULT_HAIKU_MODEL instead.Streaming
Streaming works the same as the Anthropic API — usestream=True (Python) or stream: true (JS/cURL):
Token counting
Count the tokens in a message request before sending it:Notes
- You are running open-source models via the Anthropic protocol, not Anthropic’s Claude models.
- Model names use DeepInfra identifiers (e.g.
deepseek-ai/DeepSeek-V3), not Anthropic model names. - Not all Anthropic-specific features may be supported. Standard message creation, streaming, and token counting work as expected.
Chat Completions
Use the OpenAI-compatible API instead.
Authentication
API keys and scoped JWTs.