Beyond the Basics: Designing for a Million Users
Posted by javinpaul@reddit | programming | View on Reddit | 10 comments
Posted by javinpaul@reddit | programming | View on Reddit | 10 comments
st4rdr0id@reddit
Do the authors know about software-defined datacenters? In a datacenter everything is virtual now, computing power, storage and networking. There is no "single server" in a provider like AWS, the load is being assigned to physical resources in a transparent manner.
The entire article feels like "just use cloud services". The master template is highly debatable, I won't say you need by default things like search, event queues, a data warehouse, a no Redis and a video microservices cluster.
Romeo_5@reddit
There is no “single server” anymore, yes. But they are assigned virtual compute spaces that can be bought, no? And if your request loads exceeds the CPU and RAM, you’d have to amp up the virtual server, that if you’ve not raked up AWS billing.
Also, the master template isn’t suggesting a silver-bullet solution, or telling readers to put all those scaling components together in their web applications, it’s rather a hypothetical assumption than a real one.
Ideal solutions would be to add those components once you noticed lag or failure.
maxinstuff@reddit
You don’t really need much scale for a million users on a non-critical app.
I see often people making the mistake that they think a million or even tens of millions of records is a lot of data - it’s just not. This is rife in enterprise — everything is over engineered and yet somehow at the same time extremely poorly optimised :/
Silound@reddit
I feel a major point that the author glassed over was the differentiation between scaling for simultaneous requests and scaling simply based on an arbitrary quantity, like number of users. Reading the article makes it obvious that the author is talking about the former, but it's misleading for someone who doesn't understand the difference.
I can have an app with ten thousand users that requires massive scaling due to request rate, request size, geophysical locations, processing time, or any number of other static factors. I can equally have an app with ten million users that requires very little scaling because it only has to handle a few small requests per user, per day, and the requests are well distributed throughout the day' time.
Scalability is just like any other practice or pattern in practical software and application design: it's good to design things with future scalability in mind, but if you don't need it now and there's no obvious need on the horizon, then there's no reason to waste time with implementation. Unless you're somehow blessed with an overabundance of dev time, in which case, what paradise are you living in and how do I get there?
No_Technician7058@reddit
idk i think this article is cooked.
I'm not really sure you need all that much to support 1 million users creating and reading each others posts.
I think a relational database on a big server, plus caching, basically would take you there.
most users might read their feeds a few times a day, most would be making one post or less per day, may only be liking 3 or 4 things a day, may not even open the app more than 2 or 3 times a week.
you don't need webscale architecture for this kind of thing. some clever table partitioning would get you there. all this extra cruft early on is only going to get in the way of getting the "core" right.
janyk@reddit
I'm making an educated guess and claiming that not even Instagram has cross-region transactions. I know because when I, sitting at home in North America, view my Japanese friends' Instagram profiles two days in a row, on the second day I often see a new post which I didn't see the day before but is 7 days old!
arwinda@reddit
That's just the algorithm, not cross region transactions.
nicholashairs@reddit
I only scan read the article, but I think the "real" architecture is the one at the bottom not the top.
Which looks much more reasonable.
GayMakeAndModel@reddit
You know, every time someone says users in situations like this, I have to ask: what is a user? What does the average user do? Are programs also users? On coworker responded with: well, what is love?
babige@reddit
Shiitt a million users, give me drf-django, a beefy postgres server, caching, a few ec2's, kamal2, and a cdn.