I already get rate-limited like crazy on lemmy and there are only like 60,000 users on my instance. Is each instance really just one server or are there multiple containers running across several hosts? I’m concerned that federation will mean an inconsistent user experience. Some instances many be beefy, others will be under resourced… so the average person might think Lemmy overall is slow or error-prone.

Reddit has millions of users. How the hell is this going to scale? Does anyone have any information about Lemmy’s DB and architecture?

I found this post about Reddit’s DB from 2012. Not sure if Lemmy has a similar approach to ensure speed and reliability as the user base and traffic grows.

https://kevin.burke.dev/kevin/reddits-database-has-two-tables/

  • Max-P@lemmy.max-p.me
    link
    fedilink
    English
    arrow-up
    106
    arrow-down
    1
    ·
    2 years ago

    Bigger instances will indeed run multiple copies of the various components, it’s pretty standard software in that regard.

    Usually at first that will start by moving the PostgreSQL database to its own dedicated box, and then start adding additional backend boxes, possibly adding more caching in front so that the backend doesn’t have to do as much work. Once the database is pegged, the next step is usually a write primary and one or more read secondaries. When that gets too much, you get into sharding so that you can spread the database load across multiple servers. I don’t know much about PostgreSQL but I have to assume it’s better than MySQL in that regard and I’ve seen a 1 TB MySQL database in the wild running just fine.

    I think lemmy.world in general is hitting some scalability issues that they’re working on. Keep in mind the software is fairly new and is just being truely tested at large scale, there’s probably a ton of room for optimization. Also lemmy.world is still on 0.17 and apparently 0.18 changed the protocol a lot in a way that makes it scale much better, so when they complete that upgrade it’ll probably run a lot better already.


    The part that worries me about scalability in the long term is the push nature of ActivityPub. My server is already getting several POST requests to /inbox per second already, which makes me wonder how that’s gonna work if big instances have to push content updates to thousands of lemmy instances where most of the data probably isn’t even seen. I was surprised it was a push system and not a pull system, as pull is much easier to scale and cache at the CDN level, and can be fetched on demand for people that only checks lemmy once in a while.

    I need to start digging into Lemmy’s code and get familiar with the internals, still only a couple days in with my private instance.

    • Schooner
      link
      fedilink
      English
      arrow-up
      3
      ·
      2 years ago

      Would it be feasible to change it to a pull system at this point? I don’t think the Lemmy part of that is a problem, but ActivityPub may need to make big changes and I’m not sure how practical that is.

      • DaEagle
        link
        fedilink
        English
        arrow-up
        1
        ·
        2 years ago

        It doesn’t have to change, it can support both, so lemmy-lemmy can be pull, and lemmy-kbin can be push.

        Obviously not trivial, but definitely doable.

      • terribleplan@lemmy.nrd.li
        link
        fedilink
        English
        arrow-up
        1
        ·
        2 years ago

        That would have to be a completely new version of ActivityPub, and would likely render it non-backwards-compatible (or at least things would have to still do all the old version stuff to interop with anything not on it). This has happened before (see OStatus and Diaspora vs current things using ActivityPub).

        OStatus was based on Atom (like RSS) and WebStreams (aka PubSubHubub), which was basically a pull system with a real-time notification layer on top (that could offload the fan-out work of notifications to more centralized PSH servers), and things moved away from that in favor of the more real-time ActivityPub protocol.

        I mean, I hate XML as much as the next guy, but I think there was maybe a little too much baby (general architecture) in with that bathwater (specifics of Atom/XML/Microblogging-oriented stuff).

    • aaron@lemm.ee
      link
      fedilink
      English
      arrow-up
      4
      arrow-down
      2
      ·
      2 years ago

      The implementation as far as I understand it is plain stupid. It prevents small instances from participating at any significant scale and seems happy to just drop data over the wire without reconciling. Whoever designed this needs to be hired at Reddit.

      • Max-P@lemmy.max-p.me
        link
        fedilink
        English
        arrow-up
        7
        ·
        2 years ago

        I’m not sure about the data being dropped, my instance was misconfigured for a day or two, and as soon as I fixed it, the data came right in. Instances repeatedly trying to push data to my instance is what clued me in that something was missing from my NGINX config. It backfilled pretty fast.

        Although I wouldn’t mind if there was a fallback pull mechanism to remediate failed pushes.

        • aaron@lemm.ee
          link
          fedilink
          English
          arrow-up
          2
          ·
          edit-2
          2 years ago

          Interesting. Curious if you have a better understanding of ActivePub - do you happen to know if the protocol guarantees synchonicity and what mechanism guarantees it?

          • Max-P@lemmy.max-p.me
            link
            fedilink
            English
            arrow-up
            3
            ·
            2 years ago

            I don’t, really going off on ~1 week of running my essentially single user instance and watching it do its thing. I need to read the spec and experiment with it when I have some more free time.

            Pure speculation but my guess would be that the servers are expected to retry for a certain amount of time. I know there’s been some tickets opened for some big instances going out of sync with eachother and fixes being worked on to address those. I don’t know if it only fixes it forward or if that also backfills.

            Also nothing preventing Lemmy from implementing a fallback way of doing a resync if it detects drift. “Hey lemm.ee, I lost everything since an hour ago, backfill please”.

            • aaron@lemm.ee
              link
              fedilink
              English
              arrow-up
              1
              ·
              edit-2
              2 years ago

              Yeah, or batching changes and confirming receipt with a hash, or doing pull instead of push. From what I’ve been reading, the design seems a little janky.

    • FarceMultiplier@lemmy.ca
      link
      fedilink
      English
      arrow-up
      1
      ·
      2 years ago

      I’m thinking something like a large postgresQL database box, then front facing servers running ldirectord or a similar HA load balancer to be able to add instances as necessary. However, my skills here are 20 years out of date so I’m sure there’s better out there.

      • Max-P@lemmy.max-p.me
        link
        fedilink
        English
        arrow-up
        0
        ·
        2 years ago

        Some people are already deploying it with Kubenetes, pretty much handles load balancing and even scaling up and down automatically out of the box if you’re set up in the cloud. Pretty much a long solved problems.

        Lots of nice free load balancers these days: HAproxy, NGINX, Traefik, I’ve even seen people do it in kernel with eBPF or iptables.

    • weeezes@sopuli.xyz
      link
      fedilink
      English
      arrow-up
      0
      ·
      2 years ago

      The part that worries me about scalability in the long term is the push nature of ActivityPub. My server is already getting several POST requests to /inbox per second already, which makes me wonder how that’s gonna work if big instances have to push content updates to thousands of lemmy instances where most of the data probably isn’t even seen. I was surprised it was a push system and not a pull system, as pull is much easier to scale and cache at the CDN level, and can be fetched on demand for people that only checks lemmy once in a while.

      I think there’s a benefit to the push model, as the instances can prioritize who to push to first if there’s scaling issues, instead of having to throttle GETs, effectively the end result is anyway the same that nothing ends up to other instances in real time (which is fine). I don’t know how lemmy works exactly, but could the push model just be a detail of activitypub https://flak.tedunangst.com/post/what-happens-when-you-honk ?

      • Max-P@lemmy.max-p.me
        link
        fedilink
        English
        arrow-up
        2
        ·
        2 years ago

        It definitely is part of the ActivityPub protocol, but I only glanced at the spec so far. I should probably follow that link and implement a toy ActivityPub app to get more familiar with how it works.

        I think there’s a benefit to the push model, as the instances can prioritize who to push to first if there’s scaling issues, instead of having to throttle GETs,

        The downside to this is smaller instances are penalized in that scenario, which would in turn could cause users to flock to megainstances until it becomes centralized again.

        As I said, GETs are cacheable, so if one slaps Cloudflare in front you can handle millions of GETs for relatively cheap.

        Maybe it’s batched however? I really need to read the spec. Pushing to thousands of servers every 1/5/10 minutes certainly would give a fair amount of headroom to make it work I guess.

  • HobbitFoot @thelemmy.club
    link
    fedilink
    English
    arrow-up
    55
    arrow-down
    9
    ·
    2 years ago

    Poorly. Lemmy will scale poorly.

    I won’t be surprised if the larger instances start locking down more as a way to sustain themselves, like restricting communities or only allowing text posts.

    • nyakojiru@lemmy.world
      link
      fedilink
      English
      arrow-up
      9
      ·
      2 years ago

      Sometimes you have just to accommodate to the situation and keep going until it settles down. The error I think here is thinking something can’t have flaws and issues, even more if it’s not behind a corporations. And no one wants corporations.

      • HobbitFoot @thelemmy.club
        link
        fedilink
        English
        arrow-up
        10
        ·
        2 years ago

        It isn’t about accommodating to the situation, but planning for long term growth.

        Right now, instances of Lemmy don’t have any way to fund server costs other than asking for donations. Outside of Wikipedia, that isn’t a sustainable business model. How is Lemmy supposed to survive if, every time a sub gains critical mass, it shuts down?

        • ritswd@lemmy.world
          link
          fedilink
          English
          arrow-up
          7
          ·
          edit-2
          2 years ago

          planning for long-term growth

          Which is part of any scaling effort, and you can’t really guess through predicting and resolving bottlenecks, it takes some serious expertise. And as far as I know, the Lemmy devs have never built a high-scale service before, and I think that is possibly the single biggest risk to the growth and success of the Lemmy project in general.

          Source: that’s my job, I’ve been doing that for some of the most high-scale services in the world for about a decade. I absolutely could help, actually I’d love to, but I definitely won’t under current Lemmy leadership, for reasons: https://lemmy.world/comment/596235

          • HobbitFoot @thelemmy.club
            link
            fedilink
            English
            arrow-up
            3
            ·
            2 years ago

            I get the feeling that the development leadership is going to have to change, with either the current developers bringing on leadership that can manage the growth or someone forks Lemmy and that becomes the default. The current developer model, like the current admin monetization model, can’t stand as is in its current form.

            • ritswd@lemmy.world
              link
              fedilink
              English
              arrow-up
              2
              ·
              2 years ago

              That is what I’m rooting for. Alas I can imagine how they might cling to it, even if at the detriment of the project. Or not, I don’t particularly have any information. My fingers are crossed…

            • ritswd@lemmy.world
              link
              fedilink
              English
              arrow-up
              3
              arrow-down
              1
              ·
              2 years ago

              I think Kbin is something good being built by good people, I get what they’re trying to do, but unfortunately I don’t have a lot of faith that it will turn out to be a successful project.

              In terms of technical scaling, I’m puzzled that they went with an interpreted language if the goal is scale. I get that the basic usage of Kbin’s features may not require a ton of CPU-heavy operations, or a fine handling of the memory; but once it meets sufficient scale, there will have to be some scale edge-case bottlenecks where you’ll want to step out of the beaten path and get lower-level, so I’m a bit confused about why they chose a technology that will make those harder to get past rather than easier. PHP is great for rapid prototyping, but I’d argue that’s not what the vision should be here.

              About community scale, I’m not expert, but they seem to really care to offer a karma system; and we’ve seen the karma-farming behavior that this has been incentivizing on Reddit. I don’t see why it would be any different here if enough people end up joining. Lemmy is intentionally not offering a karma system, and it really feels like the healthier move long-term.

              I think all it would take would be for the Lemmy devs to admit that they’re in over their heads, and that their political affiliations have been a hindrance to the project, to the point that they transition the governance of it to other people. I really hope they do that. If they do soon enough, they’re so far ahead and built on so much more long-term thinking, that I think it would pretty much make Kbin kinda obsolete. I have no special information about this, so I could be wrong, and I hope for them that I am; but I can see that as a pretty likely outcome.

              (That, and on the shorter-term, I wouldn’t contribute to a product I don’t use, and I can’t use it for now because my usage is 100% mobile, and the current lack of API means no native client. I wish the mobile web was better than it is as an application platform…)

              • V4uban@lemmy.world
                link
                fedilink
                English
                arrow-up
                1
                ·
                2 years ago

                Your point for Lemmy passing the governance of their project could happen. Or, as the number of other contributors increase, people will think that the impact of the two creators is diluted.

                Or, at some point, someone creates a fork of Lemmy still based on ActivityPub, still compatible with both Lemmy and Kbin, but without any of Lemmy’s political background ’

              • Hexadecimalkink
                link
                fedilink
                English
                arrow-up
                2
                arrow-down
                1
                ·
                2 years ago

                Have you found that their political leanings have affected you in any way? Just curious if you have some sort of bias that’s making you think people on the left can’t produce efficient software.

                • ritswd@lemmy.world
                  link
                  fedilink
                  English
                  arrow-up
                  1
                  ·
                  edit-2
                  2 years ago

                  It hasn’t. But letting terrible people have power affects the world in normalizing violence and hatred. It’s not about left or right, if they were American racists against Chinese people, I would have the exact same problem. I’m personally quite on the left, but without the hate.

                  I am living safe and not being targeted with hateful violence like the Uyghurs or North Koreans are, so this is far, far more important than what can affect me.

        • notavote@sh.itjust.works
          link
          fedilink
          English
          arrow-up
          1
          ·
          2 years ago

          It is not like any other social network has become sustainable business. Reddit, Twitter, YouTube, FB all are net losers with all trials with and selling user data.

          We can safely say that after almost 20 we still don’t have sustainable business model for soc networks.

          Let’s try with donations.

    • Mane25@feddit.uk
      link
      fedilink
      English
      arrow-up
      5
      ·
      edit-2
      2 years ago

      Wouldn’t that create a natural balance though? A large instance starts struggling so people are incentivised to move to smaller instances or start new instances and so spread the load more evenly. That’s how it would scale. I’m surprised how many of the larger instances haven’t closed signups yet but that wouldn’t be a bad thing if they did.

      • HobbitFoot @thelemmy.club
        link
        fedilink
        English
        arrow-up
        5
        ·
        2 years ago

        The issue isn’t on the user end, but the sub end since that is where all the data is stored.

        So, according to your proposal, the best thing a sub should do when it is getting popular is to go private with its existing subscribers and any new people who want to participate should go create their own sub in a different instance.

        • Mane25@feddit.uk
          link
          fedilink
          English
          arrow-up
          1
          ·
          2 years ago

          I wasn’t talking about subs, I’m talking about when an instance gets too popular. Ideally you’d want lots of small instances, ideally communities should be spread evenly as well and if your users are spread out that should happen more or less naturally.

    • aaron@lemm.ee
      link
      fedilink
      English
      arrow-up
      2
      ·
      2 years ago

      When the protocol favors monoliths, we’re right back to the Reddit problem

      • GoodEye8@lemm.ee
        link
        fedilink
        English
        arrow-up
        4
        ·
        2 years ago

        Scalability doesn’t mean “favoring monoliths”. It’s just scalability and honestly, 60k users shouldn’t bring a service down. 60k users is not even close to being a monolithic instance.

        • aaron@lemm.ee
          link
          fedilink
          English
          arrow-up
          1
          ·
          2 years ago

          Scalability does mean favoring monoliths because it costs money to scale and scaling here isn’t proportional to your instance’s users, it’s proportional to the size of the entire network.

          60k users is today, not tomorrow. I’m thinking forward to 6000k users.

    • Artemis@sh.itjust.works
      link
      fedilink
      English
      arrow-up
      1
      ·
      2 years ago

      Or capping their users at a sustainable amount. We may see these original slew of instances become the OGs, only opening for new users when MAUs drop past a certain threshold. Won’t really negatively impact the fediverse cause others can spin up new instances when demand… demands so lol

      • HobbitFoot @thelemmy.club
        link
        fedilink
        English
        arrow-up
        4
        ·
        2 years ago

        I don’t think it is the users that are the issue, but the subs. If a sub gets busy enough, are the server admins going to force the sub to go private? Is a sub going to splinter across several other instances, each vying to be the true successor?

    • nyakojiru@lemmy.world
      link
      fedilink
      English
      arrow-up
      1
      ·
      2 years ago

      Sometimes you have just to accommodate to the situation and keep going until it settles down. The error I think here is thinking something can’t have flaws and issues, even more if it’s not behind a corporations. And no one wants corporations.

    • nyakojiru@lemmy.world
      link
      fedilink
      English
      arrow-up
      1
      ·
      2 years ago

      Sometimes you have just to accommodate to the situation and keep going until it settles down. The error I think here is thinking something can’t have flaws and issues, even more if it’s not behind a corporations. And no one wants corporations.

  • Iteria@sh.itjust.works
    link
    fedilink
    English
    arrow-up
    42
    arrow-down
    1
    ·
    2 years ago

    Education probably. Back in the day people didn’t have any problem understanding that different forums had different capabilities. When MMOs were in full swing, people didn’t have problems understanding what being on thr popular server during peak hours meant.

    Everyone has just gotten too used to centralization with a lot of money behind it. Eventually people will adjust their expectations. Even if Meta’s fediverse attempt takes off, there are always going to be niche communities that exist outside of those spheres, so if people want that, they’ll have to move.

    The point of the fediverse is having a choice. Some people are going to chose megacorp of the week’s offering and that’s okay as long a little pockets exist for when people get mad at the megacorp. Also federation leaves space for multiple dominant platforms in a way the current system doesn’t.

    In short, eventually some instances are going to be bankrolled either through a robust crowdcourcing effort or through being a company. That’s okay. The purpose of the fediverse is to allow for smaller niche ideas to be able to breathe without having to adhere to one group’s ideals. “If you don’t like it, make your own” is a fair statement now

  • SlovenianSocket@lemmy.ca
    link
    fedilink
    English
    arrow-up
    36
    ·
    edit-2
    2 years ago

    As far as I’m aware lemmy does not support load balancing or high availability as it currently stands. But development is still in its infancy and I’m sure that’s a top priority

    • Freeman@lemmy.pub
      link
      fedilink
      English
      arrow-up
      14
      ·
      2 years ago

      I mean it can….it’s just very DB heavy. It would be on an admin to scale up and scale out a single instance witb multiple dbs, replication etc.

      It would be nice to be able to assign dbs to a task (ie: one for federation updates, one for local community posts, one to service web requests. There may be a way to do that already but I’m not aware, it may need to be in code.

      Also syncing/federation across instances seems to be a mixed bag. And my instance will sometimes waste threads trying to sync with instances they have come and gone. As a result some communities id love to see updates on don’t come through.

      Ideally they figure a way to continue to optimize federation and allow smaller instances to just pick up the load.

      Mine is open, but I’m not getting any registration requests. I’m not upset about it but their main join page still seems to optimize for larger instances. It would make more sense to optimize for smaller ones to better distribute load. And focus dev work on better l/smoother syncing between federated instances.

      Some locking down is a concern. I would love to see a lemmy of trust group if that came to pass. Where you can join the group and federate. My biggest concern with open federation is the legal risk of things like CSAM or CP getting synced onto your instances even if you have the nsfw box unchecked.

    • Max-P@lemmy.max-p.me
      link
      fedilink
      English
      arrow-up
      5
      ·
      2 years ago

      I don’t see any reasons why you couldn’t run more copies of the backend and frontend, as long as it uses the database properly. It should scale horizontally decently for a while.

      At work I have clusters that runs 40-50 application servers all going to one database and handles millions of requests daily, on a pretty inefficient PHP application. Lemmy being in Rust, it can handle a lot of traffic.

      Given the frontend is in nodejs, I suspect we’ll need to scale up the frontend first, which should be no problem at all, just many copies of the frontend to fewer copies of the backend to fewer copies of the database. Maybe slap Cloudflare in front at some point.

      It will probably get costly to run before it becomes hard technically to scale up.

      • notavote@sh.itjust.works
        link
        fedilink
        English
        arrow-up
        1
        ·
        2 years ago

        Additionally, federation messages probably can be easily separated to different server, and is being made much more efficient right now.

    • ollie@lemmy.world
      link
      fedilink
      English
      arrow-up
      3
      ·
      2 years ago

      I don’t think it officially supports it but it does work! Lemmy.world is currently running on multiple containers load balanced by nginx. look at u/ruud latest post about it

    • AggressivelyPassive@feddit.de
      link
      fedilink
      English
      arrow-up
      2
      ·
      2 years ago

      It’s not only about scaling a single instance, but scaling the fediverse.

      Currently, each instance sends all events to all federated instances. That means, essentially each instance needs to store and process a significant part of the entire fediverse. That’s insane and has to be addressed.

  • roadrunner_ex@lemmy.ca
    link
    fedilink
    English
    arrow-up
    25
    ·
    2 years ago

    It’s a challenge, for sure. It is known that there are some inefficiencies in the codebase, which are actively being worked on. But besides that, it’s tricky to know where bottlenecks are until the user influx happens, particularly with the novel federation architecture. Maybe it’s impossible to scale, maybe not, but we only now are seeing a testable use case. I would expect optimization work to start bearing fruit, but these thing take time.

  • Netto Hikari@social.fossware.space
    link
    fedilink
    English
    arrow-up
    17
    ·
    2 years ago

    Well, I run an instance, too. It’s not big at all, but I was thinking about the issue of scaling, too. You can only scale up a single server so much…

    But on the other hand, Lemmy is still young. We’ll find solutions to that problem.

    Also, interesting article. I only took a glance at it, but having only two tables kind of suggests that Reddit is using a relational database. So, if they’re not “normalizing” everything, why not use a completely different paradigm, like what MogoDB etc. has?

    • Irisos@lemmy.umainfo.live
      link
      fedilink
      English
      arrow-up
      7
      ·
      edit-2
      2 years ago

      The database isn’t really the problem in the current state of things. The server is because:

      • Until 0.18 there was no caching (for the UI) and the poorly implemented websockets
      • The developers have admited that they aren’t proficient in SQL, in which case, why not using an ORM instead? Sure, they aren’t perfect but they will do better than the average developer at scale.
      • There is no queue system for activityPub requests
      • Because there is no queue, user requests and federation have the same priority when it shouldn’t and one can bottleneck the other
      • Live inserts are used meaning that regardless of the DB used, performance is going to be killed since inserting data 1 at a time several times a second is a major waste of resource

      Tl;dr: It’s trying to do everything and not that well. So users suffer because they have to share resources with non-UI related tasks.

      The database suffer because it has to do an insert of 1 object X 50 times in a second when it could do it once for all 50 items.

      Federation suffers because you can’t offload it to a seperate machine farm whose job will be to receive and send ActivityPub requests and send/read data from the correct queues to do so.

      • BitOneZero @ .world@lemmy.world
        link
        fedilink
        English
        arrow-up
        5
        arrow-down
        1
        ·
        2 years ago

        Federation also does a lot of live HTTP connects to other peers. It looks up users for messages. The whole design is very resource intensive, one single vote, comment, post at a time. There is also a lot of boilerplate JSON overhead in sending something as simple as a single vote.

      • Netto Hikari@social.fossware.space
        link
        fedilink
        English
        arrow-up
        2
        ·
        2 years ago

        Thanks for the breakdown. I was aware of most of the issues with Lemmy. I noticed the weird layout of the database, so yeah…

        My database question however was in regards to Reddit, not Lemmy. Why use a relational database (“tables”) if you’re going to disregard most of the things a relational database does anyway… The “everything is a thing” line was why I questioned myself why Reddit wouldn’t use something like MongoDB instead.

    • R1x38rexrper
      link
      fedilink
      English
      arrow-up
      1
      ·
      edit-2
      2 years ago

      I see lemmy comes as a docker image? Do you run the DB and backend on the same box?

  • marsara9@lemmy.world
    link
    fedilink
    English
    arrow-up
    16
    ·
    2 years ago

    Lemmy is entirely open source, so you can see what their architecture looks like, etc… here: https://github.com/LemmyNet/lemmy.

    Rate limits, as I understand them from the code, should only apply on a per-IP basis. So you should only be seeing rate limit errors if:

    • your behind a CGNAT and multiple people who use your ISP are using Lemmy
    • you’re sending A LOT of requests to your instance yourself
    • the admin of your instance has significantly lowered the rate limits (viewable here: /api/v3/site)
    • radix@lemm.ee
      link
      fedilink
      English
      arrow-up
      2
      ·
      edit-2
      2 years ago

      I’m not an expert, but I thought the issue was generally that big instances like lemmy.world were getting overloaded on the server side, not that they were enforcing a manufactured rate limit on individual IPs.

      Also, someone else mentioned that on the fediverse even simple things like an upvote are slower and require more work here than in centralized platforms because they must be sent to all the instances that are indexing that user/community. As I understand, that’s inherent to the fediverse, a bug not a feature, designed for redundancy and resilience.

      Again uninformed, but Lemmy seems like it should scale fine. Bigger instances will monetize, driving prospective users to smaller instances, and then rate limiting and server lag won’t be so bad anymore.

      • marsara9@lemmy.world
        link
        fedilink
        English
        arrow-up
        2
        ·
        2 years ago

        I’m not an expert, but I thought the issue was generally that big instances like lemmy.world were getting overloaded on the server side, not that they were enforcing a manufactured rate limit on individual IPs.

        From what I can see it’s both. lemmy.world and others are getting overloaded, but there is an inherit built-in rate-limit in the code itself. You can see what those limits are via the api/v3/site. Now in theory if you’re actually getting rate-limited you should be seeing HTTP 429 responses from the server. If the server is just overloaded, you’ll get a 5xx response, the request will just timeout or at best you’ll still get a response but after a significant delay (what most people are seeing).

        Also, someone else mentioned that on the fediverse even simple things like an upvote are slower and require more work here than in centralized platforms because they must be sent to all the instances that are indexing that user/community. As I understand, that’s inherent to the fediverse, a bug not a feature, designed for redundancy and resilience.

        I don’t want to comment on this too much as I’m not an expert here, but here’s how federation / ActivityPub works from what I understand looking at the code:

        Whenever you take any action (or activity) your browser will first send that message to your instance. If your instance then owns the community that message is then propagated out to EVERY linked instance listed here: /instances / api/v3/federated_instances. If your instance doesn’t own the community, that message is forwarded off to the instance that does and they sent it out to EVERYONE on their federated instances list. As you can see this creates A LOT of network traffic.

        This posing an interesting problem… the number of ActivityPub messages goes up as the number of instances increase. But at the same time as more and more users join a single instance that require that that instance send more and more traffic to individual user’s browsers as they view and respond to posts. So the problem here is trying to find a good balance. And to top it off, the default behavior of most users is going to be to join the largest instances, making that instance incur more and more traffic to view content.

        Again uniformed, but Lemmy seems like it should scale fine. Bigger instances will monetize, driving prospective users to smaller instances, and then rate limiting and server lag won’t be so bad anymore.

        Will it though? How would an individual instance monetize? They would have to use donations. If an instance tries to add Ads, users will leave to an instance that doesn’t, making it so that they don’t get any income. They could charge a subscription fee, but again users would just leave and the admins get nothing.

        The ideal configuration of the fediverse as I see it, is if we had two types of servers 1) content servers that only hosted communities but didn’t have any real number of users, and 2) user servers that have no communities but most of the users. This way the number of API requests between instances is rather limited. When you end up with a server that has both most of the content and the userbase, the workload of that server appears to grow exponentially instead of linearly as the number of new instances rises.

  • Iron Lynx@lemmy.world
    link
    fedilink
    English
    arrow-up
    13
    ·
    2 years ago

    My expectation, or at least hope, is that Lemmy will grow horizontally, i.e. more instances for more specialised content, instead of vertically, i.e. more communities in singular, larger instances. Since it’s all federated, you can get to stuff in other instances.

    I just had an idea. Let’s compare reddit and lemmy as land use metaphors.

    Reddit is like one monolithic megacity. It’s full of communites, some big, encompassing entire neighbourhoods, and others smaller, having one street, one block, maybe even just one building.

    Lemmy is like a country, with every instance a city. Some cities are big and varied, others are smaller and specialised, like ones dedicated entirely to fishing or aviation or being German. And you can choose a city to settle in and move between cities for your content. Some cities will be more open to sharing content with residents of other cities, and others will put up bigger restrictions. There are jokes about parts of the userbase on 4chan or Tumblr forming their own subcommunities, and the fediverse allows this in a very material way.

    My expectation is that more cities may emerge as people develop more specialised communities. And since there are many cities, there is some resilience in the system. If an instance goes down, you’ve lost one instance. Out of christ knows how many. Chances are some of its content is duplicated across other instances, so nothing of value is lost. Meanwhile, if (/when) Reddit goes down, all of Reddit is gone.

    In short, I hope lemmy develops more, smaller, specialised instances over time. Reddit allowed very niche insterests to have a corner, and despite that, I think the fediverse is more suited to allow for that than a centralised service.

  • treadful@lemmy.zip
    link
    fedilink
    English
    arrow-up
    10
    ·
    2 years ago

    It’ll scale horizontally, like E-mail. Lots of different servers operated by various entities. There may be some big players, like Outlook or Gmail, but overall there could be thousands of instances.

    • hglman
      link
      fedilink
      English
      arrow-up
      4
      ·
      2 years ago

      Yes email is the right way to see this. It is more complex bc its not push only but that isnt a requirement.

    • hglman
      link
      fedilink
      English
      arrow-up
      2
      ·
      2 years ago

      Yes email is the right way to see this. It is more complex bc its not push only but that isnt a requirement.

    • hglman
      link
      fedilink
      English
      arrow-up
      1
      ·
      2 years ago

      Yes email is the right way to see this. It is more complex bc its not push only but that isnt a requirement.

    • hglman
      link
      fedilink
      English
      arrow-up
      1
      ·
      2 years ago

      Yes email is the right way to see this. It is more complex bc its not push only but that isnt a requirement.

  • webghost0101@lemmy.fmhy.ml
    link
    fedilink
    English
    arrow-up
    9
    ·
    2 years ago

    The way i see it the best way forward would be each community runs there own instance and what we now call communities should become subtopics of that community.

    So for example. Asklemmy could be an instance and its members are all people who believe in the value of that instance and want to be involved in sustaining it.

    Explainlikeiam5 could be a subcommunities of this instance because its philosophy is largely the same. If asklemmy has plenty of scientist members they could open a askscience subcommunity too.

    The majority of user traffic would all come from other smaller homerun instances.

    Big instances that try to be everything at once are a side effect of the massive growth we are experiencing, they work now but will slowly become more centralized and are therefore doomed to fail (in my opinion)

    To recalculate. How can we help Lemmy grow? By being proactive users that maintain something small we chose to care about.

  • LostRedditor
    link
    fedilink
    English
    arrow-up
    5
    ·
    edit-2
    2 years ago

    I love Lemmy but your question is legit. I just signed up with lemmy.world because lemmy.ml is slow/not responding.

    Before making a post in lemmy.world guess what? lemmy.world isn’t responding. I know they have scheduled maintenance at 9 CET but it was 20 minutes before that.

  • Greg Clarke@lemmy.ca
    link
    fedilink
    English
    arrow-up
    5
    ·
    2 years ago

    Pretty much everything but the database can be infinitely scaled. The database can be somewhat scaled by adding multiple read-only instances. But you’re always going to reach a limit where throwing more resources at the write database gives diminishing returns. At that stage you need to shard to increase performance which can be done but it’s a similar architecture to having multiple federated instances, so why not just spin up a new instance instead of growing an old one? Another way to lighten the load on instances is with efficient used of relays.

  • neardeaf
    link
    fedilink
    English
    arrow-up
    5
    ·
    2 years ago

    As someone who self hosts most things, would it be beneficial as a whole if I just ran a lemmy instance that was locked down to just me as the user? Then I would have a constantly responsive backend to access lemmy and federate with all other instances for full access for myself?