did you even read the article? It specifically only traps scrappers that don’t respect the robots.txt and put them in an endless maze of garbage information.
If you enter a site that clearly warns you “malware ahead”, that’s on you.
Yes and I am arguing that in terms of volume that’s almost nil and not even bothering the fish. If you have random words then it won’t be able to learn anything from it but it wont make them worse. Just waste resources on useless tokens which I think defeats the purpose.
did you even read the article? It specifically only traps scrappers that don’t respect the robots.txt and put them in an endless maze of garbage information.
If you enter a site that clearly warns you “malware ahead”, that’s on you.
Yes and I am arguing that in terms of volume that’s almost nil and not even bothering the fish. If you have random words then it won’t be able to learn anything from it but it wont make them worse. Just waste resources on useless tokens which I think defeats the purpose.