zlacker

[return to "HN is up again"]
1. sillys+z[view] [source] 2022-07-08 20:34:23
>>tpmx+(OP)
HN was down because the failover server also failed: https://twitter.com/HNStatus/status/1545409429113229312

Double disk failure is improbable but not impossible.

The most impressive thing is that there seems to be no dataloss, almost whatsoever. Whatever the backup system is, it seems rock solid.

◧◩
2. davedu+b2[view] [source] 2022-07-08 20:41:05
>>sillys+z
> Double disk failure is improbable but not impossible.

It's not even improbable if the disks are the same kind purchased at the same time.

◧◩◪
3. kabdib+iv[view] [source] 2022-07-08 22:34:21
>>davedu+b2
I once had a small fleet of SSDs fail because they had some uptime counters that overflowed after 4.5 years, and that somehow persistently wrecked some internal data structures. It turned them into little, unrecoverable bricks.

It was not awesome seeing a bunch of servers go dark in just about the order we had originally powered them on. Not a fun day at all.

◧◩◪◨
4. mikiem+Lb1[view] [source] 2022-07-09 03:05:29
>>kabdib+iv
You are never going to guess how long the HN SSDs were in the servers... never ever... OK... I'll tell you: 4.5years. I am not even kidding.
◧◩◪◨⬒
5. muttan+yp3[view] [source] 2022-07-09 22:01:35
>>mikiem+Lb1
It's concerning that a hosting company was unaware of the 40,000 hour situation with SSD it was deploying. Anyone in hosting would have been made aware of this, or at least should have kept a better grip on happenings in the market.
◧◩◪◨⬒⬓
6. dogeco+wJ3[view] [source] 2022-07-10 01:10:29
>>muttan+yp3
Yeah, this is why you run all equipment in a test environment for 4.5 years before deploying it to prod. Really basic stuff.
[go to top]