This isn't really true.
In this case it's specific to NVidia's tensor matrix multiply-add (MMA) instructions, which lets it use silicon that would otherwise be unusued at that point.
> Why does publishing papers require the latest and greatest GPUs?
You really do need to test these things on real hardware and across hardware. When you are doing unexpected things there are lots of unexpected interaction effects.
As a reminder, the context is "require the latest and greatest GPUs", responding to the parent comment. "General" doesn't mean "you can do this on an Intel Arc GPU" level of general.
That said, my comment could have used a bit more clarity.