https://archive.is/

Attention Required

Why do I have to complete a CAPTCHA? Completing the CAPTCHA proves you are a human and gives you temporary access to the web property. What can I do to prevent this in the future? If you are on a personal connection, like at home, you can run an anti-virus scan on your device to make sure it is not infected with malware.

If you are at an office or shared network, you can ask the network administrator to run a scan across the network looking for misconfigured or infected devices.

When trying to archive anything, or it looks to be even just visiting the site, if you block Google scripts, archive.is blocks you without solving reCaptcha. This is not good for Archival, I can’t believe anyone even thought this was a good idea. Nevermind the fact that the archives are centralized, now it’s blocked behind Google reCaptcha creating both privacy and centralization concerns.

    • @gravityOP
      link
      34 years ago

      I’m not against keeping archive.is as an option, but I do think we need a better option for archival. IPFS may be a step in the right direction, although I have little knowledge with it.

      • @jsgohac
        link
        24 years ago

        thats an ipfs use case I could finally see

  • @ajz
    link
    2
    edit-2
    2 years ago

    deleted by creator

  • @ajz
    link
    1
    edit-2
    2 years ago

    deleted by creator

      • @jsgohac
        link
        14 years ago

        that link is about startpage, no?

          • @jsgohac
            link
            34 years ago

            On my setup (normal wireguard vpn), I get captcha on archive .is though not startpage. I realize they don’t share one global blacklist, but my experience with archive .is has been overwhelmingly negative.

              • @jsgohac
                link
                2
                edit-2
                4 years ago

                its been a while since I spent any time looking at how catcha works, but it used to be the case that website operators could configure it with varying levels of strictness. also not sure if cloudflare integrates with google or runs its own shield. either way, I visit many sites with this setup and archive is is just about the only one that throws up the blocker. its probably a testament to google knowing very well my fingerprint :(

                Edit: ran into some details on https://webmasters.googleblog.com/2018/10/introducing-recaptcha-v3-new-way-to.html

                Fighting Bots Your Way Another big benefit that you’ll get from reCAPTCHA v3 is the flexibility to prevent spam and abuse in the way that best fits your website. Previously, the reCAPTCHA system mostly decided when and what CAPTCHAs to serve to users, leaving you with limited influence over your website’s user experience. Now, reCAPTCHA v3 will provide you with a score that tells you how suspicious an interaction is. There are three potential ways you can use the score. First, you can set a threshold that determines when a user is let through …

  • @geopoliticssuck
    link
    1
    edit-2
    4 years ago

    I personally just use Wayback Machine, even if it’s slow. But there’s also ArchiveBox which is open source.