zlacker

[parent] [thread] 0 comments
1. HarHar+(OP)[view] [source] 2025-01-22 04:46:54
It seems you'd need to figure periodic updates into the operating cost of a large cluster, as well as replacing failed GPUs - they only last a few years if run continuously.

I've read that some datacenters run mixed generation GPUs - just updating some at a time, but not sure if they all do that.

It'd be interesting to read something about how updates are typically managed/scheduled.

[go to top]