LoRA Adapters - DeepInfra

Deploy LoRA adapter models on top of base models hosted at DeepInfra. Your adapter is loaded on a supported base model and served with the standard OpenAI-compatible API.

Prerequisites

A LoRA adapter model hosted on Hugging Face
A base model that supports LoRA at DeepInfra (see supported base models in the upload form)
A Hugging Face token if your LoRA adapter is private
A DeepInfra account and API key

Deploy a LoRA model

Go to Dashboard
Click New Deployment
Click the LoRA Model tab
Fill in the form:
- LoRA model name — name used to reference this deployment
- Hugging Face Model Name — path to your LoRA adapter on Hugging Face
- Hugging Face Token — optional, required for private repos

Example

Using the public adapter askardeepinfra/llama-3.1-8B-rank-32-example-lora (base: meta-llama/Meta-Llama-3.1-8B-Instruct):

Go to Dashboard → New Deployment → LoRA Model
Fill in:
- LoRA model name: asdf/lora-example
- Hugging Face Model Name: askardeepinfra/llama-3.1-8B-rank-32-example-lora
Click Upload

The deployment appears in Dashboard → Deployments. Initial state is Initializing → Deploying → Running. Once running, your model page is at https://deepinfra.com/asdf/lora-example.

Inference

curl "https://api.deepinfra.com/v1/openai/chat/completions" \
  -H "Content-Type: application/json" \
  -H "Authorization: Bearer $DEEPINFRA_API_KEY" \
  -d '{
      "model": "asdf/lora-example",
      "messages": [
        {
          "role": "user",
          "content": "Hello!"
        }
      ]
    }'

The LoRA model name is used directly in the model field — the same as any other model.

Custom LLMs LoRA for Image Generation

​Prerequisites

​Deploy a LoRA model

​Example

​Inference

Prerequisites

Deploy a LoRA model

Example

Inference