My latest attempt at this is https://github.com/simonw/claude-code-transcripts which produces output like the is: https://gisthost.github.io/?c75bf4d827ea4ee3c325625d24c6cd86...
At a minimum it will help you to be skeptical at specific parts of the diff so you can look at those more closely in your review. But it can inform test scenarios etc.
I save all of mine, including their environment, and plan to use them for iterating on my various system prompts and tool instructions.
To me, quality code is quality code no matter how it was arrived at. That should be the end of it
I get that, but I guess what I'm asking is, why does it matter what you did?
The result is working, documented source code, which seems to me to be the important part. What value does keeping the prompt have?
I'm not trying to needle, I just don't see it.
It's also great for improving my prompting skills over time - I can go back and see what worked.