misk@sopuli.xyz to Technology@lemmy.worldEnglish · edit-21 天前'Meta Torrented over 81 TB of Data Through Anna's Archive, Despite Few Seeders'torrentfreak.comexternal-linkmessage-square83fedilinkarrow-up1608arrow-down11cross-posted to: audiobookbay@lemmy.dbzer0.compiracy@lemmy.dbzer0.compiracy@lemmy.dbzer0.com
arrow-up1607arrow-down1external-link'Meta Torrented over 81 TB of Data Through Anna's Archive, Despite Few Seeders'torrentfreak.commisk@sopuli.xyz to Technology@lemmy.worldEnglish · edit-21 天前message-square83fedilinkcross-posted to: audiobookbay@lemmy.dbzer0.compiracy@lemmy.dbzer0.compiracy@lemmy.dbzer0.com
minus-squareLainTrain@lemmy.dbzer0.comlinkfedilinkEnglisharrow-up1·edit-28 小时前Mistral? Deepseek? Not LLM but also SD which uses a very popular free dataset.
minus-squareFooBarrington@lemmy.worldlinkfedilinkEnglisharrow-up2·7 小时前Can I freely download all the training data for any of those? I was under the impression they were all trained on non-licensed and copyrighted data.
Mistral? Deepseek?
Not LLM but also SD which uses a very popular free dataset.
Can I freely download all the training data for any of those? I was under the impression they were all trained on non-licensed and copyrighted data.