DeepInfra hosts Whisper and other speech recognition models. Given an audio file, they produce transcribed text with per-sentence timestamps. Browse all speech recognition models.Documentation Index
Fetch the complete documentation index at: https://docs.deepinfra.com/llms.txt
Use this file to discover all available pages before exploring further.
Models
openai/whisper-large— best accuracyopenai/whisper-medium,openai/whisper-small,openai/whisper-base— faster, lighteropenai/whisper-timestamped-medium— per-word timestamp segmentation
Example
Supported audio formats
mp3wav