That's true, but it looks like it's been updated since then because the benchmar...

		yismail 17 days ago \| parent \| context \| favorite \| on: Measuring AI Ability to Complete Long Tasks That's true, but it looks like it's been updated since then because the benchmarks include Claude Opus 4.5