zlacker

[return to "Tell HN: HN was down"]
1. dang+zk[view] [source] 2025-12-17 18:09:25
>>uyzstv+(OP)
Yes, sorry! We're investigating, but my current theory is we got overloaded because I relaxed some of our anti-crawler protections a few days ago.

(The reason I did that is that the anti-crawler protections also unfortunately hit some legit users, and we don't want to block legit users. However, it seems that I turned the knobs down too far.)

In this case, though, we had a secondary failure: PagerDuty woke me up at 5:24am, I checked HN and it seemed fine, so I told PagerDuty the problem was resolved. But the problem wasn't resolved - at that point I was just sleeping through it.

I'll add more as we find out more, but it probably won't be till later this afternoon PST.

Edit: later than I expected, but for those still following, the main things I've learned are (1) pkill wasn't able to kill SBCL this time - we have a script that does that when HN stops responding, but it didn't work, so we'll revise the script; and (2) how to get PagerDuty not to let you go back to sleep if your site is actually still down.

◧◩
2. shlomo+Rq[view] [source] 2025-12-17 18:38:06
>>dang+zk
Crazy that Dang literally manages HN in his sleep!

We all knew that but I haven't seen any confirmation before this.

◧◩◪
3. easter+Ct1[view] [source] 2025-12-18 00:40:25
>>shlomo+Rq
I like hacker news but I don't think this site is worth getting paged over lol
◧◩◪◨
4. archon+9W1[view] [source] 2025-12-18 06:01:17
>>easter+Ct1
You might be underestimating HN's popularity.
◧◩◪◨⬒
5. lockni+oY1[view] [source] 2025-12-18 06:29:44
>>archon+9W1
> You might be underestimating HN's popularity.

I think you're confusing popularity with criticality. I'm sure everyone in here can withstand a few hours without browsing the page.

◧◩◪◨⬒⬓
6. bayind+T72[view] [source] 2025-12-18 08:11:46
>>lockni+oY1
If you like the thing you're managing, then its health is critical for you, not your users.

It's dang's baby at this point, and this is a good thing, as long as HN doesn't affect his life in ways he doesn't want.

◧◩◪◨⬒⬓⬔
7. lockni+ec2[view] [source] 2025-12-18 08:55:33
>>bayind+T72
[flagged]
◧◩◪◨⬒⬓⬔⧯
8. bayind+Qm2[view] [source] 2025-12-18 10:32:49
>>lockni+ec2
I have a pretty firm grip on life and touch plenty of grass both literally and figuratively.

However, when something I care about crashes and burns once in a blue moon, I make sure to put the fire out, at least to make it survive till regular hours. Things I care about can be both business and personal, and nobody bugs me for them.

Maybe we shouldn't make any assumptions about people we don't personally know, while we are at it.

◧◩◪◨⬒⬓⬔⧯▣
9. lockni+Uw3[view] [source] 2025-12-18 17:09:42
>>bayind+Qm2
> However, when something I care about crashes and burns once in a blue moon, I make sure to put the fire out, at least to make it survive till regular hours.

You are free what you choose to do with your personal life.

Meanwhile, it is pretty obvious that it's pointless to demand or expect personal sacrifice to maintain unrealistic levels of high-availability in services that are far from critical. I mean, do you honestly believe that these messages you and I are writing are so important to get out that someone must sacrifice their personal time to ensure it is served to the world in this very instant instead of, say, 3 or 6 or 13 hours? Absurd.

◧◩◪◨⬒⬓⬔⧯▣▦
10. bayind+oD3[view] [source] 2025-12-18 17:37:01
>>lockni+Uw3
It looks like I failed to convey what I've tried to say in the first comment. Let me reiterate one more time.

    - I believe dang sees HN as his baby, so *voluntarily* monitors it as a critical infrastructure *for him*.
    - I personally like this kind of commitment from people who like their job, however *I don't expect or demand it in any way*.
    - I also hope that attention doesn't affect his life. *Especially negatively and/or in a crippling way*.
I don't care whether this site is down for 6 seconds or 6 hours. I just wanted to commend him for liking what he's doing this much. I demand nothing from any service provider I use. Let it be a small, one person operation or dang or Amazon/Google.

I also keep servers up in my daily job, and some are more important than others, but none of them requires me to wake up 5AM to solve a problem (by design). So I don't demand anything from others something which I won't do.

As long as nobody is dying, nobody should stop, drop, and work on something else regardless of time, date and location.

[go to top]