For AI a random bit flip doesn't matter much.
Try doing fault injection on a chip some time. You'll see it's significantly easier to cause a crash / reset / hang than to just flip data bits.
'rad-triggered bit flips don't matter with AI' is a lie spoken by people who have obviously never done any digital design in their life.
I would say they probably something a little beefier than consumer hardware and just deal with lots of failures and bit flips.
But cooling is a bigger issue probably?