zlacker

[parent] [thread] 0 comments
1. wat100+(OP)[view] [source] 2026-02-01 18:15:24
I haven’t tried it in a while, but LLMs inherently don’t distinguish between authorized and unauthorized instructions. I’m sure it can be improved but I’m skeptical of any claim that it’s not a problem at all.
[go to top]