zlacker

[parent] [thread] 3 comments
1. k8sToG+(OP)[view] [source] 2025-12-05 16:10:40
It's not about outages. It's about the why. Hardware can fail. Bugs can happen. But to continue a roll out despite warning sings and without understanding the cause and impact is on another level. Especially if it is related to the same problem as last time.
replies(1): >>udev40+3b
2. udev40+3b[view] [source] 2025-12-05 16:56:14
>>k8sToG+(OP)
And yet, it's always clownflare breaking everything. Failures are inevitable, which is widely known, therefore we build resilience systems to overcome the inevitable
replies(1): >>deadba+Fh
◧◩
3. deadba+Fh[view] [source] [discussion] 2025-12-05 17:23:12
>>udev40+3b
It is healthy for tech companies to have outages, as they will build experience in resolving them. Success breeds complacency.
replies(1): >>wizzwi+i01
◧◩◪
4. wizzwi+i01[view] [source] [discussion] 2025-12-05 20:48:41
>>deadba+Fh
You don't need outages to build experience in resolving them, if you identify conditions that increase the risk of outages. Airlines can develop a lot of experience resolving issues that would lead to plane crashes, without actually crashing any planes.
[go to top]