Scaling a forum application like Reddit/Lemmy - fallback to read-only mode

RoundSparrow · edit-2 2 years ago

Scaling a forum application like Reddit/Lemmy - fallback to read-only mode

RoundSparrow · 2 years ago

On “Major Social Events” - the owner/operators may also want an option to turn off image loading too during 'surges. As they may want to try and prevent bandwidth overages. Again, this is assuming that small time operators don’t have the budgets to spin up new capacity rapidly like Twitter,Reddit,Facebook.

RoundSparrow · 2 years ago

Social media can mean social hostility. Some of the features being discussed here may be needed because of hostile actors trying to drive the owner/operators into meeting their moderation demands or political intents to suppress discussion of topics.

RoundSparrow · edit-2 2 years ago

On “Major Social Events” - the owner/operator may want an option to active a “limp mode” of the application where all pages on the site show a status message that the site is currently overloaded or otherwise impaired. Ideally this would allow a message form the owner/operator to go out to the API requests, webapp clients, etc.

Maybe even have it trigger automatically under some kind of error-measurement threshold, such as database queries failing or exceeding a measured amount of time.

Again, this assumes that small-time owner/operators are running on more limited staff and hardware budgets - a 24x7 forum can be a challenge to keep on top of.

RoundSparrow · 2 years ago

When I talk about having intermediate caching of the comment data, such as disk file or NoSQL database… there is an implied assumption that people reading the website are far more common than people commenting or posting. Voting can be tricky, recording the votes, and I suggest that votes be queued to an app that updates the database as opposed to live-updates on every vote by end users.

Another assumption is that scaling the app could have multiple servers running the Lemmy API talking to a single (network local) PostgreSQL instance. I currently have no idea of the history or practicality of this in the current code base. But generally you strive for the DBMS to be the one to deal with the data in a consistent transactional way - and rendering caching be done inside the (Lemmy) API application.

All this discussion assumes you are rolling your own custom programming to do this, not using some kind of intermediate off-the-shelf caching layer.