zlacker

[return to "Tell HN: HN was down"]
1. dang+zk[view] [source] 2025-12-17 18:09:25
>>uyzstv+(OP)
Yes, sorry! We're investigating, but my current theory is we got overloaded because I relaxed some of our anti-crawler protections a few days ago.

(The reason I did that is that the anti-crawler protections also unfortunately hit some legit users, and we don't want to block legit users. However, it seems that I turned the knobs down too far.)

In this case, though, we had a secondary failure: PagerDuty woke me up at 5:24am, I checked HN and it seemed fine, so I told PagerDuty the problem was resolved. But the problem wasn't resolved - at that point I was just sleeping through it.

I'll add more as we find out more, but it probably won't be till later this afternoon PST.

Edit: later than I expected, but for those still following, the main things I've learned are (1) pkill wasn't able to kill SBCL this time - we have a script that does that when HN stops responding, but it didn't work, so we'll revise the script; and (2) how to get PagerDuty not to let you go back to sleep if your site is actually still down.

◧◩
2. echelo+co[view] [source] 2025-12-17 18:26:29
>>dang+zk
I didn't realize you were carrying the pager too! Kudos!
◧◩◪
3. malwra+px[view] [source] 2025-12-17 19:04:57
>>echelo+co
I feel such a sense of kinship for anyone who carries a pager, almost 7 years at my current role doing it. Super cool that dang is among our number :)
◧◩◪◨
4. idontw+OH[view] [source] 2025-12-17 19:53:07
>>malwra+px
Do you carry a literal pager? We use the PagerDuty app.
◧◩◪◨⬒
5. geocra+5L[view] [source] 2025-12-17 20:08:14
>>idontw+OH
My organization is, for now, using OpsGenie.

My pager noise: https://www.soundjay.com/transportation/sounds/train-crossin...

That will not only wake the dead, it'll wake me no matter how asleep I am.

[go to top]