Bearer authentication header of the form Bearer <token>, where <token> is your auth token.
model name for deepinfra (username/mode-name format)
The type of GPU the deployment is running on
L4-24GB, L40S-48GB, A100-80GB, H100-80GB, H200-141GB, B200-180GB, B300-270GB, RTXPRO6000-96GB, other Number of GPUs used by one instance
1 <= x <= 8Maximum number of concurrent requests
1 <= x <= 256Base public model
Extra command line arguments for custom deployments
Successful Response
Deploy Id
"fkj843kjh8"
Model Id from huggingface
"google/vit-base-patch16-224"
Model version
"d8b79b422843bd59d628bf25b01aded94a9ec1a9b917e69fe460df9ff39ec42b"
Task
"image-classification"
Status
"deployed"
Failure reason
"Initialization failed"
Created at
"2021-08-27T17:19:21+00:00"
Updated at
"2021-08-27T17:19:21+00:00"
legacy, llm, lora, tts Details about number of instances running right now
Immutable deploy configuration
Scale Settings