Audio & Voice
Whisper AI
Open-source speech recognition for multilingual transcription and translation with AI.
Whisper AI by OpenAI is a robust ASR system supporting 99 languages with accent resilience. Ideal for researchers and app developers, it enables accurate transcriptions and translations from audio/video content.
Parakeet-tdt-0.6b-v2: A 600M-parameter ASR model for accurate English transcription with punctuation, capitalization & timestamp prediction. Handles 24-min audio efficiently.