Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

Yeah it's COT is interesting, it was supposedly RL on evaluations and gets paranoid that it's being evaluated and in a simulation. I asked it to critique output from another LLM and told it my colleague produced it, in COT it kept writing "colleague" in quotes as if it didn't believe me which I found amusing


Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: