Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

I believe the ARC-AGI benchmark fits that description, it's sort of an IQ test for LLMs, though I would caution against using the word "Intelligence" for LLMs.


Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: