Streaming Transcription
Convert audio to text in real-time using WebSocket connections. Perfect for voice agents and live applications.Quick Start
Available Models:fireworks-asr-large
: Cost efficient model for real-time transcription over web-socketsfireworks-asr-v2
: Next generation and ultra-low latency audio streaming for real-time transcription over web-sockets
Pre-recorded Transcription
Convert audio files to text. Supports files up to 1GB in formats like MP3, FLAC, and WAV. Transcribe multiple hours of audio in minutes.Quick Start
For a working example of pre-recorded transcription see the Python notebook Available Models:whisper-v3
: Highest accuracy- model=
whisper-v3
- base_url=
https://audio-prod.us-virginia-1.direct.fireworks.ai
- model=
whisper-v3-turbo
: Faster processing- model=
whisper-v3-turbo
- base_url=
https://audio-turbo.us-virginia-1.direct.fireworks.ai
- model=
Pre-recorded Translation
Translate audio from any of our supported languages to English. Supports files up to 1GB in formats like MP3, FLAC, and WAV.Quick Start
Supported Languages
We support 95+ languages including English, Spanish, French, German, Chinese, Japanese, Russian, Portuguese, and many more. See the complete language list.Common Use Cases
- Call Center / Customer Service: Transcribe or translate customer calls
- Note Taking: Transcribe audio for automated note taking
Next Steps
- Explore advanced features like speaker diarization and custom prompts
- Contact us at inquiries@fireworks.ai for dedicated endpoints and enterprise features