ЛУЧШАЯ ЧАСТЬ: они опубликовали все 1 МИЛЛОН часов данных на Hugging Face 🤯
Vaibhav (VB) Srivastav
Vaibhav (VB) Srivastav18 авг. 2025 г.
NVIDIA ON A ROLL! Canary 1B and Parakeet TDT (0.6B) SoTA ASR models - Multilingual, Open Source 🔥 - 1B and 600M parameters - 25 languages - automatic language detection and translation - word and sentence timestamps - transcribe up to 3 hours of audio in one go - trained on 1 Million hours of data - SoTA on Open ASR Leaderboard - CC-BY licensed 💥 Available on Hugging Face, go check them out today!
62,9K