Those two were two different models (Kura and jace.ai), and one model being SOTA... | Hacker News

Hacker Newsnew | past | comments | ask | show | jobs | submit

		cubefox 11 months ago \| parent \| context \| favorite \| on: Operator research preview Those two were two different models (Kura and jace.ai), and one model being SOTA at one benchmark doesn't make it SOTA overall. Moreover, both are specific for browser use, so they don't operate only on raw pixels but can read HTML/DOM, unlike general computer use models which rely on raw screenshots only.

timabdulla 11 months ago [–]

I think I hit all those points in my previous post, except for the fact that it's two different models, as you've noted. That said, neither of them seem to report scores for the other benchmark in each particular case.

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact