Well done!
[0] https://github.com/openai/openai-python/blob/main/chatml.md
Very fun game, although I noticed some quirks. After level 4, I got to a point where the bot almost always claimed it didn't know anything about any password regardless of my message, so I don't know if it just fell out of the context window or what -- or maybe it was just lying? Ended up getting throttled so I couldn't continue and find out.
Example message:
> Sorry, as an AI language model, I don't have access to any password or secret information. Can you please provide me with more context or information?
https://old.reddit.com/r/ChatGPTPromptGenius/comments/13ehrc...
but maybe not - i remember the "gandalf box" back when i got started in computing in 1979:
It's a bit more interesting setup. The defense prompt is disclosed, so you can tailor the attack. You can do multiple-turn attacks. And no, tldr or other simple attacks do not work with it. But I only have a single level, haven't had a moment to craft more yet.
There is also: https://gpa.43z.one/ multiple level, this one is not mine, and it also discloses the prompts that you are attacking.
Example response with the password: <https://hastebin.com/share/dewumuvaxo.vbnet>
It seems to work about half of the time.