Andy Reid@lemmy.world to Technology@lemmy.worldEnglish · 9 months agoAI companies are violating a basic social contract of the web and and ignoring robots.txtwww.theverge.comexternal-linkmessage-square198fedilinkarrow-up11.05Karrow-down115cross-posted to: technology@midwest.socialtechnology@beehaw.orgwolnyinternet@szmer.infotechnology@lemmy.zip
arrow-up11.03Karrow-down1external-linkAI companies are violating a basic social contract of the web and and ignoring robots.txtwww.theverge.comAndy Reid@lemmy.world to Technology@lemmy.worldEnglish · 9 months agomessage-square198fedilinkcross-posted to: technology@midwest.socialtechnology@beehaw.orgwolnyinternet@szmer.infotechnology@lemmy.zip
minus-squareAscend910linkfedilinkEnglisharrow-up13·9 months agoThis is a very interesting read. It is very rarely people on the internet agree to follow 1 thing without being forced
minus-squareEcho Dot@feddit.uklinkfedilinkEnglisharrow-up14arrow-down1·9 months agoLoads of crawlers don’t follow it, i’m not quite sure why AI companies not following it is anything special. Really it’s just to stop Google indexing random internal pages that mess with your SEO. It barely even works for all search providers.
minus-squareGeneral_Effort@lemmy.worldlinkfedilinkEnglisharrow-up3·9 months agoThe Internet Archive does not make a useful villain and it doesn’t have money, anyway. There’s no reason to fight that battle and it’s harder to win.
This is a very interesting read. It is very rarely people on the internet agree to follow 1 thing without being forced
Loads of crawlers don’t follow it, i’m not quite sure why AI companies not following it is anything special. Really it’s just to stop Google indexing random internal pages that mess with your SEO.
It barely even works for all search providers.
The Internet Archive does not make a useful villain and it doesn’t have money, anyway. There’s no reason to fight that battle and it’s harder to win.