Deploy Llm Presets
Dedicated Models
Deploy Llm Presets
DeepInfra-tested preset deploy configs for hf_repo_id (HF org/name), for
the deploy dashboard to pre-fill. An empty list — the common case — means none.
Omit engine to get presets across all engines.
GET
Deploy Llm Presets
Authorizations
Bearer authentication header of the form Bearer <token>, where <token> is your auth token.
Query Parameters
Available options:
L4-24GB, L40S-48GB, A100-80GB, H100-80GB, H200-141GB, B200-180GB, B300-270GB, RTXPRO6000-96GB, other Response
Successful Response
Preset id.
Allowed Nx hardware configs.
Source of this config (e.g. deepinfra).
Inference engine the preset was tuned for.
Preset engine tuning knobs.
Short display name for the preset (e.g. "Throughput-optimized").