Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

I totally buy the thesis on specialization here, I think it makes total sense.

Asides from the obvious concern that this is a tiny 8B model, I'm also a bit skeptical of the power draw. 2.4 kW feels a little bit high, but someone else should try doing the napkin math compared to the total throughput to power ratio on the H200 and other chips.

 help



Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: