Hi everybody!! I have the archive of everything ever posted on r/GenZedong, and I would like to eventually get it online for when the subreddit eventually gets banned (inevitable lol).

would people use it? do people want it? what does everybody think?

uwu

(actual photo of me at the genzedong archive)

  • Arsen6331 ☭
    link
    fedilink
    12 years ago

    you could compress the whole thing as a 7Z file, using 7-Zip or PeaZip, to make it compact.

    Zstandard for speed or Brotli for compression ratio would probably work better.

    • @TheAnonymouseJoker@lemmygrad.ml
      link
      fedilink
      1
      edit-2
      2 years ago

      Do Zstandard and Brotli have higher compression than 7Z LZMA2, or FreeARC’s ARC format? The latter 2 top efficiency charts, from my archival compression knowledge of the past 10-12 years. Once you encode/package anything, the bandwidth and storage savings are harvested forever.

      • Arsen6331 ☭
        link
        fedilink
        1
        edit-2
        2 years ago

        I did a few tests. I tried compressing a config file with a bunch of algorithms at their highest compression levels. This is what I got:

        2880	traefik.nomad
        1088	traefik.nomad.gz
        1078	traefik.nomad.zst
        1100	traefik.nomad.xz
        1219	traefik.nomad.7z
        918	traefik.nomad.brotli
        

        traefik.nomad is the original. As you can see, Zstandard and Brotli have the best compression ratios. Zstandard is also insanely fast, capable of around 500 MB/sec/core.

        This is not the only time I’ve tested this. I’ve done it with videos, images, random text files, documents, etc., and Brotli always wins in compression ratio, while Zstandard always wins in speed.

        • @TheAnonymouseJoker@lemmygrad.ml
          link
          fedilink
          1
          edit-2
          2 years ago

          Configuration text files are not the only type of files. You could use PPMd in 7-Zip for it. You need to use a variety of files to benchmark. Which and what were the versions of the compression binaries you used? Did you try FreeARC on it?

          I saw this: https://peazip.github.io/fast-compression-benchmark-brotli-zstandard.html

          https://patrickgawron.com/t3/2018/01/30/multi-threaded-7-zip-with-zstandard-brotli-lz4-lz5-and-lizard-compression-support/

          These do not have FreeARC which has even more compression but at little extra time cost. After this, we have PAQ and ZPAQ at various levels, which are impractical for both compression and decompression time efficiency.

          • Arsen6331 ☭
            link
            fedilink
            12 years ago

            Configuration text files are not the only type of files.

            I know. As I’ve mentioned, I have used many types of files in the past, and the general trend is that Brotli has the best compression ratios while Zstandard has the best speeds. I just used a config file as it is very compressible.

            Which and what were the versions of the compression binaries you used?

            • gzip 1.12
            • zstd 1.5.2
            • xz 5.2.6 (liblzma 5.2.6)
            • p7zip 17.04
            • brotli 1.0.9

            Running on Arch Linux

            Did you try FreeARC on it?

            I did not try FreeARC as it is abandoned and I can’t seem to figure out how to use its command-line utility.

            Also, worth mentioning, is that 7zip and FreeARC are both archiving formats that use modified versions of existing compression algorithms, so I wouldn’t really compare them to the other algorithms on the list.

            • FreeARC has a GUI for Windows that is runnable under Wine with no performance penalty. You can learn commandline if you wish to avoid Wine usage.

              p7zip 17.04 is very old now. There is 7-Zip 22.01 available since 3 months. Although it should not affect the benchmarking too much, it is something to note. The binaries of 7-Zip 22.01 should be available in PeaZip’s newest version as well.

              7Z and ARC are not merely modified versions of existing algorithms, but are traditional compression formats that are geared to be more flexible than Google and Facebook created TAR based counterparts here. This is not a correct way to dismiss comparison, when some are superior in application than others.

              Non-traditional compressors are made as an attempt to fine tune and allow Tar-Gzip or Tar-Bzip2 as distribution formats, and to monopolise the archival compression space. They fail at it because of immaturity and worse optimisations.

              You can see in the PeaZip benchmark article above how 7Z is far superior even with just default compression settings. This simply changed to unbeatable levels in terms of ratio with higher compression levels like Maximum or Ultra.

              Also, abandonment of FreeARC does not mean it is unsuitable for benchmarking or real world usage.