• CodeInvasion@sh.itjust.works
    link
    fedilink
    arrow-up
    41
    arrow-down
    1
    ·
    10 months ago

    AFAIK, there’s nothing stopping any company from scraping Lemmy either. The whole point pf reddit limiting API usage was so they could make money like this.

    Outside of morals, there is nothing to stop anybody from training on data from Lemmy just like there’s nothing stopping me from using Wikipedia. Most conferences nowadays require a paragraph on ethics in the submission, but I and many of my colleagues would have no qualms saying we scraped our data from open source internet forums and blogs.

    • Leraje@lemmy.blahaj.zone
      link
      fedilink
      English
      arrow-up
      21
      ·
      10 months ago

      You’re right, anyone can scrape Lemmy. But that’s not the issue (to me anyway) - Reddit have sold user data - user generated content. None of what they’re profiting from was generated or created by them. Are Reddit users who did generate all this content getting a slice of the profits?

      When I post on here I know it’s all open for anyone to access but that’s true of any non walled garden space. I’ve accepted the fact that it’s going to get fed into the hungry maw of some AI behemoth or two.

      What Reddit have done is make money for doing absolutely nothing based on content others have created like some sort of technological tapeworm feeding second hand. And along the way they killed off a lot of tools that users loved, moderators found made their jobs easier and people with a visual disability found vital. And all this so u/spez can live out his mini-Musk fantasies.