Skip to main content
DeepInfra GPU Clusters give you dedicated access to NVIDIA B200 and B300 hardware — the most powerful GPUs available. You get a container with SSH access and full control over your environment, billed by the hour.

Available GPUs

GPUMemoryBest for
B200-180GB180GB HBM3eLarge-scale inference, fine-tuning, training
B300-288GB288GB HBM3eLargest models, maximum throughput, multi-node training

Key features

  • Dedicated access — no sharing with other users
  • SSH access — connect directly to your container
  • Full environment control — bring your own Docker image or use ours
  • Pay-per-use — billed by the hour, only while running
  • Quick setup — running in minutes

Use cases

  • LLM training and fine-tuning
  • Large-scale batch inference
  • Research and experimentation
  • Development environments with GPU access

Web UI

Start a new container

  1. Go to Dashboard → Instances
  2. Click New Container
  3. Select GPU configuration — choose from available B200/B300 configs. Each shows:
    • GPU type, quantity, and memory (e.g., 1xB200-180GB, 8xB300-288GB)
    • Hourly pricing
    • Availability status
  4. Enter container details:
    • Container Name — a descriptive name
    • SSH Key — paste your public SSH key (format: ssh-rsa AAAAB3NzaC1yc2E...)
  5. Accept the NVIDIA license agreements and cryptocurrency mining policy
  6. Click I agree to the above

Connect to a running container

  1. Wait for container status to show running
  2. Click on the SSH login field to copy the command
  3. Run ssh ubuntu@<ip-address> in your terminal

Stop a container

  1. Click on the container in the instances list
  2. Click Terminate
  3. Type confirm and click Terminate
All container data is permanently lost when terminated. Save your work before stopping.

HTTP API

Create a container

curl -X POST https://api.deepinfra.com/v1/containers \
  -H "Authorization: Bearer $DEEPINFRA_TOKEN" \
  -H "Content-Type: application/json" \
  -d '{
    "name": "my-cluster",
    "gpu_config": "8xB200-180GB",
    "container_image": "di-cont-ubuntu-torch:latest",
    "cloud_init_user_data": "#cloud-config\nusers:\n- name: ubuntu\n  shell: /bin/bash\n  sudo: '\''ALL=(ALL) NOPASSWD:ALL'\''\n  ssh_authorized_keys:\n  - ssh-rsa AAAAB3NzaC1yc2E..."
  }'

Get container details

curl -X GET https://api.deepinfra.com/v1/containers/{container_id} \
  -H "Authorization: Bearer $DEEPINFRA_TOKEN"
Once running, connect via SSH:
ssh ubuntu@<container-ip>

List containers

curl -X GET https://api.deepinfra.com/v1/containers \
  -H "Authorization: Bearer $DEEPINFRA_TOKEN"

Terminate a container

curl -X DELETE https://api.deepinfra.com/v1/containers/{container_id} \
  -H "Authorization: Bearer $DEEPINFRA_TOKEN"

Container lifecycle

StateDescription
creatingContainer is being initialized
startingContainer is booting up
runningContainer is active and accessible
shutting_downContainer is being terminated
failedContainer failed to start or encountered an error
deletedContainer has been permanently removed