Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

It's not an objective test like you are talking about. These benchmarks are far from accurate and also can be tainted in the training data.


You'll find the same thing in many academic/scientific papers




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: