Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

Google has papers on device speech recognition, these are used in the keyboard & for live caption on Pixel devices.


This article from the Google AI blog about the Gboard speech recognition is really interesting: https://ai.googleblog.com/2019/03/an-all-neural-on-device-sp...


They are trained on a ton of non-public data though, and I’m not sure if pre-trained models are around.


Nope, they aren't available. CC YouTube videos with captions or radio broadcasts + transcripts could prove helpful for multiple languages as well as being able to create a multilingual ASR.




Consider applying for YC's Summer 2026 batch! Applications are open till May 4

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: