Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

Aren’t books massively outweighed by the crawled internet corpus?


I would doubt that because books are probably weighed as higher quality and more trustworthy than random Reddit posts

Especially if it's unsupervised training




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: