Prerequisites
- A LoRA adapter model hosted on Hugging Face
- A base model that supports LoRA at DeepInfra (see supported base models in the upload form)
- A Hugging Face token if your LoRA adapter is private
- A DeepInfra account and API key
Deploy a LoRA model
- Go to Dashboard
- Click New Deployment
- Click the LoRA Model tab
- Fill in the form:
- LoRA model name — name used to reference this deployment
- Hugging Face Model Name — path to your LoRA adapter on Hugging Face
- Hugging Face Token — optional, required for private repos
Example
Using the public adapteraskardeepinfra/llama-3.1-8B-rank-32-example-lora (base: meta-llama/Meta-Llama-3.1-8B-Instruct):
- Go to Dashboard → New Deployment → LoRA Model
- Fill in:
- LoRA model name:
asdf/lora-example - Hugging Face Model Name:
askardeepinfra/llama-3.1-8B-rank-32-example-lora
- LoRA model name:
- Click Upload
Initializing → Deploying → Running.
Once running, your model page is at https://deepinfra.com/asdf/lora-example.
Inference
model field — the same as any other model.