I know that Lemmy is open source and it can only get better from here on out, but I do wonder if any experts can weigh in whether the foundation is well written? Or are we building on top of 4 years worth of tech debt?

  • BURN@lemmy.world
    link
    fedilink
    arrow-up
    1
    ·
    1 year ago

    It’s decent, but it isn’t scalable, at least not yet.

    Right now the entire Lemmy backend is one big “monolith”. One app does everything from logins and signups to posting and commenting. This makes it a little harder to scale, and I wouldn’t be surprised to see it split out into multiple micro services sooner rather than later so some of the bigger instances can scale better.

    I’d love to know where the higher level dev stuff is being discussed and if they’ve made a decision on why or why not microservices.

    • ShittyKopper [they/them]@lemmy.w.on-t.work
      link
      fedilink
      English
      arrow-up
      0
      ·
      edit-2
      1 year ago

      Microservices aren’t a silver bullet. There’s likely quite a lot that can be done until we need to split some parts out, and once that happens I expect that federation would be the thing to split out as that’s one of the more “active” parts of the app compared to logins and whatnot.

      • BURN@lemmy.world
        link
        fedilink
        English
        arrow-up
        0
        ·
        1 year ago

        Definitely not a silver bullet, but should stop the app from locking up when one thing gets overloaded. I’m sure they have their reasons for how it’s designed now and I’m probably missing something that would explain it all.

        I’m still not familiar enough with how federation works to speak to how easy that would be. Unfortunately this has happened all as I’ve started moving and I haven’t gotten a chance to dive into code like id want to.

        • Sir_Simon_Spamalot@lemmy.world
          link
          fedilink
          English
          arrow-up
          0
          ·
          1 year ago

          It’s also not the only solution for high-availability system. Multiple monoliths with load-balancing can be used as well.

          Also, a lot of people are self-hosting. In this case, microservice won’t give them any scaling benefit.

          • boeman@lemmy.world
            link
            fedilink
            English
            arrow-up
            1
            ·
            1 year ago

            The problem with scaling monoliths is you are scaling everything, including the pieces that have lower usage. The huge benefit you get from going to micoservices is you only have to scale the pieces that need to be scaled. This allow for horizontal scaling to use less compute resources. It also allow for these compute resources to be spread out as well.

            A lot of the headaches can be removed by having an effective CI/CD strategy that is completely reusable with minimal effort.

            The last headache would be observability. There you’re stuck either living with the nightmare of firefighting problems with 100 services in possibly 10 locations, rolling your own platform using FOSS tools or spending a whole lot of money on something like honeycomb, datadog or new relic.

            But I’m an SRE, I live my life for scalablability and DevOps processes. I know I’m biased.

            • Sir_Simon_Spamalot@lemmy.world
              link
              fedilink
              English
              arrow-up
              0
              ·
              1 year ago

              I’m a software engineer myself, but not familiar with your field. How would your practice be applied to self-hosting? I’m assuming a bunch of people with their home servers wouldn’t want to just run OpenShift.

              • boeman@lemmy.world
                link
                fedilink
                English
                arrow-up
                1
                ·
                1 year ago

                Personally, I wouldn’t touch OpenShift. As someone that has a kubernetes cluster hosted at my house on a mixture of RPis, a nas and in VMs, I’m not one to to say what anyone else would do :).

                But, that can be overcome, it’s all about designing you application for multiple different installs. You don’t have to have all your services running fully separately. You can containerize each service and deploy to an orchastration engine such as kubernetes or docker swarm, or you can have the multiple endpoints on a single machine with an install package that keeps them together. It’s all about architecting toward resiliency, not toward a pattern for no other reason.

                Also, Google has some very good books on SRE for free.

    • sosodev@lemmy.world
      link
      fedilink
      arrow-up
      0
      ·
      1 year ago

      There’s no reason that a monolith can’t scale. In fact you scale a monolith the same way you scale micro services.

      The real reason to use micro services is because you can have individual teams own a small set of services. Lemmy isn’t built by a huge corporation though so that doesn’t really make sense.

      • myersguy@lemmy.simpl.website
        link
        fedilink
        arrow-up
        1
        ·
        1 year ago

        I disagree that it being a monolith is immediately a problem, but also

        In fact you scale a monolith the same way you scale micro services.

        This is just not true. With microservices, it is easy to scale out individual services to multiple instances as demand requires them. Hosting a fleet of entire Lemmy instances is far more expensive than just small slices of it that may require the additional processing power.

          • myersguy@lemmy.simpl.website
            link
            fedilink
            arrow-up
            1
            ·
            1 year ago

            Well, I’m going to start by repeating that I don’t necessarily agree that it being monolithic is necessarily a problem right now.

            The immediate thought in my mind would be all of the federation logic. That’s where all of the instances seem to be lagging behind, and it seems the common fix is “just increase the workers to one billion”. Apparently that does something meaningful, but the developer in me wants to know how a few cores can put so many workers to use.

            Spinning federation off into a microservice means you could deploy it on something like Cloud Run or AWS ECS, and have it autoscale as the workload demands it. Seems like a pretty prime candidate to me.

          • BURN@lemmy.world
            link
            fedilink
            arrow-up
            1
            ·
            edit-2
            1 year ago

            I’d split it out into a few systems

            • Signup/Login/Account management
            • Posting/Commenting/Voting
            • Moderation Controls
            • DB Readers and Writers in different services
            • Community management (may get lumped into moderation controls, but separation of duties may be needed)
            • Edit: Federation is a big one I keep forgetting about

            The ultimate goal, and I don’t know how possible it is with rust, would be to have a way to run those as individual services or as one part of a larger monolith. Smaller instances would be able to run it as one binary, while larger instances like Lemmy.world or Lemmy.ml can run each part independently. That would allow easier scaling on large instances while (hopefully) leaving it just as simple to deploy on a small scale too.

            There’s no reason to split it into 100 different services, but 4-5 mid sized ones might help.

        • boonhet@lemm.ee
          link
          fedilink
          arrow-up
          0
          ·
          edit-2
          1 year ago

          Lemmy’s backend is native code, not run in a virtual machine or interpreted from text like most backends (Java and PHP are still so, so popular, as is Python). You’re not going to pay much extra for an extra megabyte or 10 of RAM being used per instance for the extra code sitting idle. It certainly shouldn’t use much processing power when not in use.

          • myersguy@lemmy.simpl.website
            link
            fedilink
            arrow-up
            1
            ·
            1 year ago

            I’d be less concerned with memory (of which Lemmy seems to use very little), and much more concerned with CPU core count. I touched on it in my other comment, but I don’t understand how a few cores is supposed to handle the ridiculous number of federation workers people are setting their instances to.

            • boonhet@lemm.ee
              link
              fedilink
              arrow-up
              0
              ·
              1 year ago

              Does code that exists but isn’t being executed often actually impact CPU usage that much?

              Linux kernel has like 30 million lines of code and you probably compile a fifth of it into your actual kernel binary by default. That’s still several million lines, but it doesn’t use much CPU at all. Rather, eliminating all the excess code keeps your size down so you can load it from disk in 0.0001 seconds instead of 0.0002.

              Now if code that doesn’t need to run often, runs more often because we scaled the monolith horizontally - that IS a problem, but it’s not a problem inherent to the monolith design pattern, but rather a specific instance of bad design.

              • myersguy@lemmy.simpl.website
                link
                fedilink
                arrow-up
                1
                ·
                edit-2
                1 year ago

                Again, my knowledge of the Lemmy codebase is very small, and we could possibly host the monolith in microservices style. The point I am making is this (when it comes to scaling monolith vs microservices):

                If the federation logic were split out, we could configure it to run on super tiny docker instances on Google Cloud or AWS. Any time we needed it to, it would autoscale to handle the traffic. The configuration for these dockers could be super minimal memory, no storage, and multiple weak CPU cores. This would be super affordable while still being able to handle as much traffic from federation as we ask it to. One of the cool things with Google Cloud Run is that it handles load balancing between docker containers for you (just point the federation traffic at the necessary URL)

                IF Lemmy has things like background services, scheduled tasks, etc, this would significantly muddy the water (we would need each service to be able to handle being run on a multitude of instances, or we would need to be able to disable each one instance by instance). And if we just scaled by spinning up more instances of Lemmy, we would also need to ensure that only federation traffic is heading to the weaker instances that we spun up for such purpose, or we would need to ensure that each spun up instance has enough resources to handle federation traffic along with the main application.

                I feel like I need to state once more: I don’t necessarily think Lemmy needs to move to microservices. Only that scaling monolith vs microservices is not necessarily the same.

        • sosodev@lemmy.world
          link
          fedilink
          arrow-up
          0
          ·
          1 year ago

          You can easily scale a monolith. You typically horizontally replicate any web server (monolith or not) to handle whatever traffic you’re getting. It shouldn’t really matter what type of traffic it is. Plenty of the world’s biggest websites run monoliths in production. You know how people used to say “rails doesn’t scale”? Well they were wrong because Rails monoliths are behind some huge companies like GitHub and Shopify.

          The lemmy backend is also quite lightweight and parallel so it’s cheap and effective to replicate.

          In my professional experience microservices are usually a dumpster fire from both the dev perspective and an ops perspective (I’m a Site Reliability Engineer).

          • myersguy@lemmy.simpl.website
            link
            fedilink
            arrow-up
            1
            ·
            1 year ago

            This isn’t really contradictory to what I said. I only wished to express that the two don’t scale exactly the same (in terms of execution)

        • sosodev@lemmy.world
          link
          fedilink
          arrow-up
          0
          arrow-down
          1
          ·
          1 year ago

          You can easily scale a monolith. You typically horizontally replicate any web server (monolith or not) to handle whatever traffic you’re getting. It shouldn’t really matter what type of traffic it is. Plenty of the world’s biggest websites run monoliths in production. You know how people used to say “rails doesn’t scale”? Well they were wrong because Rails monoliths are behind some huge companies like GitHub and Shopify.

          The lemmy backend is also quite lightweight and parallel so it’s cheap and effective to replicate.

          In my professional experience microservices are usually a dumpster fire from both the dev perspective and an ops perspective (I’m a Site Reliability Engineer).

          • boeman@lemmy.world
            link
            fedilink
            arrow-up
            1
            ·
            1 year ago

            I can’t say I disagree… Poorly implemented microservice architecture is the bane of my existence. Well implemented, though, and it makes my job so much easier.

            Granted, my SRE team has all public facing production infrastructure built using an IAC process, if something causes too much trouble, it’s easier to quarantine and rebuild the offending node(s), and can be complete in under 10 minutes.

            The biggest problem is far too many developers ignore the best practices and just shift existing code into smaller services. That will never give you either performance or stability benefits. Honestly, it will probably make any issues worse. Microservice architecture is a huge shift in thinking. The services need to be fairly independent of each other to really make any gains. To get to that point will always take a whole lot of work. That being said, there is nothing inherently wrong with some monoliths, but the benefits of splitting out as much of the higher traffic and resource intensive work should never be overlooked.