https://crfm.stanford.edu/helm/air-bench/latest/#/leaderboard This isn’t the got... | Hacker News

Hacker Newsnew | past | comments | ask | show | jobs | submit

		conception 30 days ago \| parent \| context \| favorite \| on: Claude's new constitution https://crfm.stanford.edu/helm/air-bench/latest/#/leaderboar... This isn’t the gotcha question you think it is. AI safety is being defined and measured.

viccis 30 days ago [–]

Cool, another metric to game like they do the other ones.

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact