• someguy7734206@sh.itjust.works
    link
    fedilink
    arrow-up
    25
    ·
    1 year ago

    One thing I’ve started to think about for some reason is the problem of using AI to detect child porn. In order to create such a model, you need actual child porn to train it on, which raises a lot of ethical questions.

    • breadcodes@lemm.ee
      link
      fedilink
      arrow-up
      27
      ·
      1 year ago

      Cloudflare says they trained a model on non-cp first and worked with the government to train on data that no human eyes see.

      It’s concerning there’s just a cache of cp existing on a government server, but it is for identifying and tracking down victims and assailants, so the area could not be more grey. It is the greyest grey that exists. It is more grey than #808080.

      • well, many governments had no issue taking over a cp website and hosting it for montha to come, using it as a honeypot. Still they hosted and distributed cp, possibly to thousands of unknown customers who can redistribute it.

    • XPost3000
      link
      fedilink
      arrow-up
      9
      ·
      1 year ago

      I’m pretty sure those AI models are trained on hashes of the material, not the material directly, so all you need to do is save a hash of the offending material in the database any time that type of material is seized

        • XPost3000
          link
          fedilink
          arrow-up
          4
          ·
          1 year ago

          Nah, flipping the image would completely bypass a simple hash map

          From my very limited understanding it’s some special hash function that’s still irreversible but correlates more closely with the material in question, so an AI trained on those hashes would be able to detect similar images because they’d have similar hashes, I think

    • LeylaaLovee@lemmy.blahaj.zone
      link
      fedilink
      arrow-up
      2
      ·
      edit-2
      1 year ago

      This is a stupid comment trying to hide as philosophical. If your website is based in the US (like 80 percent of the internet is), you are REQUIRED to keep any CSAM uploaded to your website and report it. Otherwise, you’re deleting evidence. So all these websites ALREADY HAVE giant databases of child porn. We learned this when Lemmy was getting overran with CP and DB0 made a tool to find it. This is essentially just using shit any legally operating website would already have around the office, and having a computer handle it instead of a human who could be traumatized or turned on by the material. Are websites better for just keeping a database of CP and doing nothing but reporting it to cops who do nothing? This isn’t even getting into how moderators that look for CP STILL HAVE TO BE TRAINED TO DO IT!

      Yeah, a real fuckin moral quandary there, I bet this is the question that killed Kant.