Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

Recording audio with FFMPEG, and transcribing a file that’s piped from FFMPEG both work.

Given that it took 19.64 mins to transcribe the 11 second sample wav, it’s possible I just didn’t wait long enough :)



Ah. In that case... Yeah. Is it using GPU, and does the whole model fit in your (V)RAM?


This is a CPU implementation only.


Oh, that's interesting. The readme talks about GPU acceleration on Apple Silicon and I didn't see anything explicit for other platforms, so I assumed it needs GPU everywhere, but it does BLAS acceleration which a web search seems to agree is just a CPU optimized math library. That's great; should really increase the places where it's useful:)


It should be possible to develop a cuBLAS backend to accelerate BLAS on Nvidia.




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: