Hacker News
new
|
past
|
comments
|
ask
|
show
|
jobs
|
submit
login
JohnnyMarcone
11 days ago
|
parent
|
context
|
favorite
| on:
Measuring AI Ability to Complete Long Tasks
Thanks for this comment. I've been trying to find anything about the huge error bars. Do you have any sources you can share for further reading?
Guidelines
|
FAQ
|
Lists
|
API
|
Security
|
Legal
|
Apply to YC
|
Contact
Search: