Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

I switched an agent from Sonnet V2 to o3-mini (default medium mode) and got strangely poor results: only calling 1 tool at a time despite being asked to call multiple, not actually doing any work, and reporting that it did things it didn't


Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: