Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

In this sentence, Anthropic makes clear that "be hurtful" and "lead to public embarrassment" are separate and distinct. Otherwise it would not be necessary to specify both. I don't think this is the signal they should be sending the model.


Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: