I was just reading posts on there a few minutes ago and it is now banned. That sub was pretty much just educational, this shit is getting ridiculous.

  • DongFangHong@lemmygrad.ml
    link
    fedilink
    arrow-up
    62
    ·
    3 years ago

    I backed up all submissions on /r/GenZhou last week because I was afraid this would happen.

    If you want to download the raw json, see /c/GenZhouArchive

        • PorkrollPosadist@lemmygrad.ml
          link
          fedilink
          arrow-up
          9
          ·
          edit-2
          3 years ago

          If anyone wants to take a stab at this, I recommend setting up a local development instance (following the instructions provided in the lemmy docs) to test against. You don’t need to learn Rust to do this, but you will need to have Rust, PostgreSQL, Node and Yarn at your disposal. Alternately, you can use the docker method, but I find it much easier to introspect what the hell is going on with the database when it isn’t inside of a container.

          After starting up the dev instance, you can use something like pgAdmin to poke around at your leisure and figure out the schema. You can also look through the Diesel migrations (written in SQL, used to initialize / upgrade / downgrade the database), however these are split up into dozens of individual updates chronologically and it is easy to overlook important bits of context (i.e. a table is created in one migration, a column is removed in another, two more columns are added in a third one, etc).

          Edit: Speaking for myself, having no affiliation with Lemmygrad, I think creating dummy users would be excessive and complicated. It would probably double the site’s user count just to import the archive. What might be better is to use a single account named “GenZhouArchive” or something and add a string to the start of each comment indicating the original reddit username. Along the same lines, it might be a good idea to do the same with post/comment scores. Applying the Reddit comment scores to Lemmy posts/comments will flood the top sort heavily.

          • KiaKaha@lemmygrad.ml
            link
            fedilink
            arrow-up
            2
            ·
            3 years ago

            Another approach that hits some of those concerns is to create a whole new instance of @archive.ml, and import everything there as read-only. That way you keep all the architecture, and it’s accessible via Lemmy, but it doesn’t flood the database and stats of this instance.

        • comfy
          link
          fedilink
          arrow-up
          5
          ·
          3 years ago

          It would probably be best to ask the developers to be most certain. If you already have Matrix chat, you can ask in #lemmy:matrix.org or the dev chat channel in their subject line.

            • comfy
              link
              fedilink
              arrow-up
              3
              ·
              3 years ago

              You’re right, I was tired and commented before reading the whole thing. I just saw the talk of schemas and thought I understood.

        • Muad'Dibber@lemmygrad.ml
          link
          fedilink
          arrow-up
          4
          ·
          3 years ago

          Are you thinking about importing all of the genzhou posts and comments into the lemmygrad database as posts/comments on genzhouarchive?

          Yep, precisely. If you know SQL you could do the create / batch import statements with that schema.

          1

          They are likely pretty close, but if you pull down lemmy and run it locally, you could inspect the postgres tables directly. Since migrations might have some column changes since those creates.

          2

          Probably just a single user, and maybe in the comment body you could have the top line be their real reddit username.

    • TheConquestOfBed
      link
      fedilink
      arrow-up
      6
      ·
      edit-2
      3 years ago

      This is great! But for some reason I can’t view that comm from my Lemmy.ml account. :(

      (It is visible in-browser when logged out though?)