> A well-trained LLM that lacks any malevolent data This is self-contradictory. ...

Dylan16807 · 2025-07-17T06:40:28 1752734428

I ask this with all sincerity, why is it important to be able to detect malevolent intentions from the person you're giving therapy to? (In this scenario, you cannot be hurt in any way.)

In particular, if they're being malevolent toward the therapy sessions I don't expect the therapy to succeed regardless of whether you detect it.