archive.is Requires Google Captcha

https://archive.is/

Attention Required

Why do I have to complete a CAPTCHA? Completing the CAPTCHA proves you are a human and gives you temporary access to the web property. What can I do to prevent this in the future? If you are on a personal connection, like at home, you can run an anti-virus scan on your device to make sure it is not infected with malware.

If you are at an office or shared network, you can ask the network administrator to run a scan across the network looking for misconfigured or infected devices.

When trying to archive anything, or it looks to be even just visiting the site, if you block Google scripts, archive.is blocks you without solving reCaptcha. This is not good for Archival, I can’t believe anyone even thought this was a good idea. Nevermind the fact that the archives are centralized, now it’s blocked behind Google reCaptcha creating both privacy and centralization concerns.

@ajz
link
1
edit-2
1Y

If you are using Tor it will show you a reCaptcha. The advantage of using archive.is /vn/ph is that you don’t need an account for it, like is the case with the well-known archive.org (wayback machine).

@Echedenyan
link
41Y

Even without Tor you can see the captcha given a thing commented in other thread.

@jsgohac
link
11Y

that link is about startpage, no?

@Echedenyan
link
1
edit-2
1Y

Yes, but applies to any global captcha service which could blacklists ips from any of the webpages that use it (as it is not just a webpage using it but is widely used (this could also affect to webpages being widely used instead of a decentralized network))

@jsgohac
link
31Y

On my setup (normal wireguard vpn), I get captcha on archive .is though not startpage. I realize they don’t share one global blacklist, but my experience with archive .is has been overwhelmingly negative.

@Echedenyan
link
11Y

yes, startpage use its own one but is a widely used website as it is centralized. however recaptcha is a service shared between different websites with only central point comming from google.

@jsgohac
link
2
edit-2
1Y

its been a while since I spent any time looking at how catcha works, but it used to be the case that website operators could configure it with varying levels of strictness. also not sure if cloudflare integrates with google or runs its own shield. either way, I visit many sites with this setup and archive is is just about the only one that throws up the blocker. its probably a testament to google knowing very well my fingerprint :(

Edit: ran into some details on https://webmasters.googleblog.com/2018/10/introducing-recaptcha-v3-new-way-to.html

Fighting Bots Your Way Another big benefit that you’ll get from reCAPTCHA v3 is the flexibility to prevent spam and abuse in the way that best fits your website. Previously, the reCAPTCHA system mostly decided when and what CAPTCHAs to serve to users, leaving you with limited influence over your website’s user experience. Now, reCAPTCHA v3 will provide you with a score that tells you how suspicious an interaction is. There are three potential ways you can use the score. First, you can set a threshold that determines when a user is let through …

@Echedenyan
link
31Y

reCatpcha could be also forced in some websites as nyaa.si has a good example in certain sections, but when used to prevent abuse and it is released automatically, works like that. Mostly with cloudflare. Cloudflare can use reCaptcha but i think hCaptcha (its own one) can be also chose.

@jsgohac
link
21Y

I guess for my purposes, I only want to know if archive is is part of the surveillance state. In the meantime, I don’t use it.

@geopoliticssuck
link
1
edit-2
1Y

I personally just use Wayback Machine, even if it’s slow. But there’s also ArchiveBox which is open source.

@resynth1943
link
8
edit-2
8M

deleted by creator

@gravity
creator
link
31Y

I’m not against keeping archive.is as an option, but I do think we need a better option for archival. IPFS may be a step in the right direction, although I have little knowledge with it.

@jsgohac
link
21Y

thats an ipfs use case I could finally see

@resynth1943
link
5
edit-2
8M

deleted by creator

A place to discuss privacy and freedom in the digital world.

Privacy has become a very important issue in modern society, with companies and governments constantly abusing their power, more and more people are waking up to the importance of digital privacy.

In this community everyone is welcome to post links and discuss topics related to privacy.

Some Rules

  • Posting a link to a website containing tracking isn’t great, if contents of the website are behind a paywall maybe copy them into the post
  • Don’t promote proprietary software
  • Try to keep things on topic
  • If you have a question, please try searching for previous discussions, maybe it has already been answered
  • Reposts are fine, but should have at least a couple of weeks in between so that the post can reach a new audience
  • Be nice :)

Related communities

much thanks to @gary_host_laptop for the logo design :)

  • 0 users online
  • 25 users / day
  • 73 users / week
  • 208 users / month
  • 612 users / 6 months
  • 3485 subscribers
  • 1884 Posts
  • 8403 Comments
  • Modlog