• Adalast@lemmy.world
    link
    fedilink
    English
    arrow-up
    12
    arrow-down
    2
    ·
    7 months ago

    Can we take a minute and stop to assess where Adobe is obtaining its training data? Everyone is all up in arms about the OpenAI devs scraping DA and such, but Adobe is 100% training on the entirety of Behance and the Adobe Cloud. Things that are not public, our personal files that we never intended others to be seen. Our private albums of our children, or our wives/husbands/partners, or parts of NDA restricted projects that are stored in Adobe Cloud automatically that are supposedly not in violation of our NDAs.

    Where are the pitchforks? Where is the outrage? This is 1000x worse than some desperate AI engineer staring at a publicly visible and available training set that is already tagged and described in detail that was begging to be used. People lost their shit over that one. Why does Adobe get a pass?

    • Echo Dot@feddit.uk
      link
      fedilink
      English
      arrow-up
      2
      ·
      7 months ago

      Aren’t they training it exclusively from their own data sets which presumably they already own the licenses for?

      • Adalast@lemmy.world
        link
        fedilink
        English
        arrow-up
        8
        ·
        7 months ago

        Their Creative Cloud license

        That little “derivative works” bit in the middle gives them license to use the files stored in Creative Cloud to train AIs. So yes, they are using their data sets that they have license for. It just happens to be our data that they took the license on and we paid them to do it.

      • Adalast@lemmy.world
        link
        fedilink
        English
        arrow-up
        3
        ·
        7 months ago

        That’s fun, glad to see they are paying people now. I didn’t see in there when in the multi-years long process it takes to develop tool-sets and train checkpoints they paid for the rights to create derivative works. The article is dated a few days ago and it is present tense. They are NOW paying. The AI is trained. The tool is built. It takes tens of thousands of images to train a generative model from scratch, I would expect decades of footage for a video model. So if the model is trained, and them paying is new…?

        Also, they don’t have to ask, or pay… They already have the rights for all content stored in Creative Cloud (EULA Link). Adobe Creative Cloud EULA

        Legally, an AI training is a “derivative work”, so I would need a letter from the lead engineers on the AI dev team at Adobe, signed by every dev who has worked on it, stating that they only used paid training material at every stage of development of the tools, disseminated separately from any official Adobe channel before I would believe that the greedy gaping maw that is Adobe did not just use the millions of images and thousands of years of footage they have legal right to use that THEY are actually PAID for. They know they can pay now because it is a drop in the bucket compared to the Creative Cloud fees and is great PR and an even better smokescreen. There is precisely 0 chance they are going to receive enough good, usable footage through this program to train an AI from scratch.