https://api.deepinfra.com. It offers two sets of endpoints:
-
OpenAI-compatible endpoints at
/v1/openai— drop-in replacement for OpenAI chat completions, embeddings, and image generation. Use these with any OpenAI SDK. -
DeepInfra native endpoints at
/v1/inference/{model_name}— direct access to all model types including speech recognition, object detection, image classification, and more.
Authentication
All endpoints require a Bearer token in theAuthorization header: