cantankerous_cashew@lemmy.world to Technology@lemmy.worldEnglish · 1 day agoMeta Secretly Trained Its AI on a Notorious Piracy Database, Newly Unredacted Court Docs Revealwww.wired.comexternal-linkmessage-square23fedilinkarrow-up1309arrow-down15cross-posted to: technology@lemmy.worldwired@rss.ponder.cat
arrow-up1304arrow-down1external-linkMeta Secretly Trained Its AI on a Notorious Piracy Database, Newly Unredacted Court Docs Revealwww.wired.comcantankerous_cashew@lemmy.world to Technology@lemmy.worldEnglish · 1 day agomessage-square23fedilinkcross-posted to: technology@lemmy.worldwired@rss.ponder.cat
minus-squareCriticalMiss@lemmy.worldlinkfedilinkEnglisharrow-up14·24 hours agoEarlier reports suggested they trained it on books from Bibliotik. What changed?
minus-squarehalcyoncmdr@lemmy.worldlinkfedilinkEnglisharrow-up24·23 hours agoProbably just both honestly.
minus-squareBetaDoggo_@lemmy.worldlinkfedilinkEnglisharrow-up3·17 hours agoThe llama-1 paper acknowledged the use of the books dataset, libgen isn’t mentioned in any of the papers so this is new info.
Earlier reports suggested they trained it on books from Bibliotik.
What changed?
Probably just both honestly.
In for a penny and for a pound.
The llama-1 paper acknowledged the use of the books dataset, libgen isn’t mentioned in any of the papers so this is new info.