This not expensive or a hard problem. You grab a bunch of servers, you set them up properly, and then you write your app properly.
No resume-driven bullshit; no hype-driven bullshit; no “we need to be galaxy scale now” bullshit. No email notifications, besides basic “thanks for registering, here is your login” and “here’s a password reset link.” No cloud-based bullshit. Don’t use fucking python. Use a real systems language to eek out as much performance as you can from the hardware. Actually understand databases and how your specific databases work. Use Postgres unless you have a very good reason not to.
Just a few thousand dollars a month, and a brief reprieve from short-term mania to actually think, and you too can literally serve 1 billion pages a day.
Why does everyone run into problems with this? Because they have personal hang-ups and delude themselves (or simply don’t care). This path has been tread numerous times before. The mistakes have been made thousands of times. The people who made those mistakes are available to help you out (for the right price, or if you’re good enough company).
I am sick and tired of systems engineering being grandized, when all you have to do is sit down somewhere quiet and think about the problem — with a bit of tea, and some way to access reference material.
Reddit is not a hard or interesting problem.
> Reddit is not a hard or interesting problem.
Exactly, tech is full of this weird hubris that everything has to be super complicated and over-engineered.
Heck, while you exclaim disdain for Python, I've seen large web services run on Django and a few servers behind load balancers with very few problems.
Unlike Reddit, you'll have the benefit of the hindsight of 2023 instead of managing 20 years of tech debt.
They did resort to all kinds of tricks. But your overal point still stands. The performance of python is lacking memory and it's embarrassingly slow. I hope python4 will have scripted for developing and compiled for production, like Dart. And a great compiler like Rust.
[0]https://instagram-engineering.com/static-analysis-at-scale-a...
The keyword here of course is "at scale". At what scale? Any commenter that believes what was written upthread should create a system and demonstrate that it can scale to Reddit levels.
I said large, not huge :P
I'm afraid I don't want to dox myself so I can't post publicly stuff from my employer. And I don't really have time to do what ask and write it up in my free time.
I doubt something the size of reddit would run properly on Python, but I think both mine and the commenter I replied to had the point that most sites on the internet WOULD run fine without all the bloatware and overengineering complexity. Very very few sites have the traffic that reddit does. Most websites belong to the long tail, and for those almost any tech stack would work - so why choose a needlessly complex one?
I'm sure not even HN "does only that" and even that it does with a lot of help from caching, etc
"Take my word for it, I'm just superior" the comment, which would be as fitting on r/SneerClub today SlashDot 15 years ago or Usenet 30 years ago as a dismissive geek putdown-cum-status grab, could have been summarised as.