Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

We detect any use of any and the LLM has to fix them before our check succeeds. It does and works fine.


currently starting to do the same over seer's frontend, i didn't realise how simple yet effective this technique / guardrail could be!




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: