Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

About whisper, anyone knows a project (github) about using the model in real-time? I'm studying a new language, and it appears to be a good chance to use and learning pronunciation vs. word.


This one uses faster-whisper as the backend, I've tried with small model and the performance is good. https://github.com/collabora/WhisperLive

The is another one that uses huggingface's implementation, but I haven't tried it since my spec doesn't support flash-att2 https://github.com/luweigen/whisper_streaming


Thanks. I'll try.




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: