1. These data are publicly available, I just “rearranged” them in a database I created.
  2. Understand these data as trends, not reality, there is billions of erotics comics/hentais out there and my database only hold 43 893 of them.
  3. I am not a data analyst. These values do not hold a real value and I can have made errors.

I have a LOTS more data I can share like: most popular characters/parodies/tags, etc.

I do NOT encourage any of you to read erotic comics/hentais. I personally don’t but I though these data might be interesting.

  • corvus
    link
    fedilink
    arrow-up
    1
    ·
    8 hours ago

    What do you mean by “average number of pages”? Average over what?

    • Llufollis@sh.itjust.worksOP
      link
      fedilink
      Français
      arrow-up
      1
      ·
      4 hours ago

      Each entry in the database contains a language and a number of pages. I sorted all the entries by language and took the average number of pages for each of them. But it also display major weakness, each language don’t have the same number of entries, some have thousands, others less than a hundred. I should have “normalized” the number of entry for each language and exclude languages which don’t have enough entries.

    • Llufollis@sh.itjust.worksOP
      link
      fedilink
      Français
      arrow-up
      2
      ·
      17 hours ago

      The language hasn’t been defined. I just took the language defined in the metadata of each books without processing them, so the typos and weird stuff are because the website owner or the website it take the books from have different naming convention/made typos (e.g. Japanese, Javanese).

  • fallowseed@lemmy.world
    link
    fedilink
    arrow-up
    4
    ·
    edit-2
    21 hours ago

    “i only look at them for the analysis” ;) i’m curious what it is about the catalan/bulgarian/dutch-- is it because there are simply less releases in those countries so more entry-level stuff, does it have to do with how their language condenses?

    • ImplyingImplications@lemmy.ca
      link
      fedilink
      arrow-up
      1
      ·
      19 hours ago

      It’s probably just that people in those countries prefer their erotic comics to be short and to the point. The chart even has a bar for “speechless” so I don’t think language density has much to do with page count. There’s also a bar for “translated” which seems to show translators pick shorter comics, which is interesting.