Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

That seems promising for applications that require raw speed. Wonder how much they can scale it up - 8B model quantized is very usable but still quite small compared to even bottom end cloud models.
 help



Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: