Kind of like recreating your image one object at a time. It might not be exact, but close enough.
Best you can do is to mask and keep inpainting the area that looks different until it doesn't.
At some point the input must be considered part of the work. At the limit you could just describe every pixel, but that certainly wouldn’t mean the model contained the work.