OpenAI just admitted it can’t identify AI-generated text. That’s bad for the internet and it could be really bad for AI models.::In January, OpenAI launched a system for identifying AI-generated text. This month, the company scrapped it.

  • Womble@lemmy.world
    link
    fedilink
    English
    arrow-up
    7
    ·
    1 year ago

    Your assertion that a future AI detector will be able to detect current LLM output is dubious. If I give you the sentence “Yesterday I went to the shop and bought some milk and eggs.” There is no way for you or any detection system to tell if that was AI generated or not with any significant degree of certainty. What can be done is statistical analysis of large data sets to see how they “smell”, but saying around 30% of this dataset is likely LLM generated does not get you very far in creating a training set.

    I’m not saying that there is no solution to this problem, but blithely waving away the problem saying future AI will be able to spot old AI is not a serious take.

    • lily33@lemmy.world
      link
      fedilink
      English
      arrow-up
      2
      arrow-down
      3
      ·
      1 year ago

      If you give me several paragraphs instead of a single sentence, do you still think it’s impossible to tell?

      • steakmeout@lemmy.world
        link
        fedilink
        English
        arrow-up
        1
        ·
        1 year ago

        “If you zoom further out you can definitely tell it’s been shopped because you can see more pixels.”