Whoops, I literally did the same thing as this guy earlier this week, but did the testing using `claude -p` so I can identify when Claude Code would (or would not) load Skills for a particular prompt, so that I could improve the skill definition.
Who knew that using Claude to introspect on itself was against the ToS?