Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

When you say

>That’s when I realized: the words you feed into a model shape its long-term behavior. Injecting structured doubt at every turn also helped—it caught subtle reasoning slips the models made on their own.

Was that not obvious working with LLLM's from the first moment? As someone running their own version of Vending-Bench, I assume you are above-average in working with models. Not trying to insult or anything, just wondering what the mental model you had before was and how it came to be, as my perspective is limited only to my subjective experiences.



Good question! It was not that I didn’t understand prompt influence. It’s that I underestimated its persistence over a long time horizon.


Ahhhh okay, makes sense, thanks for answering.




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: