It should be easy for a company like Anthropic to prove this beyond a doubt. Why... | Hacker News

Hacker Newsnew | past | comments | ask | show | jobs | submit

		irthomasthomas 16 hours ago \| parent \| context \| favorite \| on: Statement on US government directive to suspend ac... It should be easy for a company like Anthropic to prove this beyond a doubt. Why don't they? Why don't they have a collection of prompts and side-by-side comparisons with other models showing how far ahead they are?
		help

largbae 15 hours ago [–]

I think it's mainly because the difference in models at the frontier isn't "response to prompt X", but rather "coherence with 500K tokens of context and instructions in play"

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact