For a while now, I've had infrequently occurring arcane cert/SSL issues connecting to archive.ph and its siblings, but trying a couple of links from the article I find I can't get past an endless cycle of "one more step" captcha protection - tried clearing all cookies and revisiting an old url, but to no avail.
Help:Using archive.today - Wikipedia: https://en.wikipedia.org/wiki/Help:Using_archive.today
archive.today - FAQ : https://archive.md/faq
archive.today - wiki : https://wiki.archiveteam.org/index.php/Archive.today
Archive Team wiki : https://wiki.archiveteam.org/
archive.today - Blog : https://blog.archive.today/
Tumblr : https://archive-is.tumblr.com/
Twitter : https://twitter.com/archiveis
archive.today
archive.ph
archive.is
archive.li
archive.vn
archive.fo
archive.md
archiveiya74codqgiixo33q62qlrqtkgmcitqx5u2oeqnmn5bpcbiyd.onion
Launched May 16, 2012; 11 years agoDNS was ISP, not 1.1.1.1, and I get the same behaviour after switching to 8.8.8.8.
captcha is CF
Related: Does Cloudflare’s 1.1.1.1 DNS Block Archive.is? (2019)
HN Discussion (209-comments 2023-08-02) >>36970702
I just snapshot this page for a test : https://archive.is/MUhAP = >>37009598
Edit of formatting for readability.
My woes with the site is that my connection to any of the clearnet domains seem to get black holed, or completely blocked by Cloudflare while using Tor. The onion site works fine for viewing, but to archive pages I need to complete the extremely difficult Cloudflare CAPTCHA.
Do they run their own resolver, or rely on an extant service?
I'm reticent to disclose my current DNS provider,
given that I am able to access archive.is and many are not at this point of time.
Edit
I have just checked 9.9.9.9 and they are serving the correct response now. (incorrect earlier)
Though I am lucky compared to other users, I only need to click a box, and wait ~1.5 secs, no pictures or endless loop.
there was a recent move from the eu to have an eu-centric public resolver which brought up the question if/how the big players address country specific filtering requirements which in turn might have shed some light on the fact that gog/cf didn't care; until now.
archive.is is blackholed in many places.
HN Readers, Commentators and Story Submitters are aware of the FAQ; 'Are paywalls ok?
It's ok to post stories from sites with paywalls that have workarounds. : https://news.ycombinator.com/newsfaq.html
Which means for the Publishers of pay-walled sites to be featured on HN,
which is a prime site for garnering potential paying new prospects and to replace natural attrition of subscribers.
Some Publishers see this as a positive.. free samples, a minor amount of the Publications full output,
much like a paid agent in a supermarket giving out cheese and/or spiced meat on a stick etc, to encourage new users.
Other Publishers see this as mice nibbling at their cheese.
I use archive.is because almost of the Articles submitted to HN are already archived, so it is a simple copy and past.
archive.is does Not require java to read or to archive an Article,
it is fast and the archiving scripts work on most sites.
Generally I enjoy archive.today very much, but it seems to be a labour of love which can go away any moment (despite its apparent resiliency), rather than something for the ages...
Oh and - given the right adlists - may also prevent infecting your machine/network/... with malware...
Not to speak of clients which may not equipped with on-device adblockers, such as TVs etc...
Not sure why they do that. Is it just because it looks decent, or is it poking fun, maybe because of their issue with 1.1.1.1?
I am not sure why Liberapay is qualified as 'obscure'. Their website's "legal" page [0] clearly identifies the organization and its legal representative, while providing contact details. The status of the non-profit organization can be verified in the French government's website [1].
[0]: https://en.liberapay.com/about/legal [1]: https://www.journal-officiel.gouv.fr/pages/associations-deta... - in French
I would expect that only a relatively small proportion of people would have heard of Liberapay, so I think calling them "obscure" is not wrong.
Just highlighting the flaw in your assumption that the anonymous person behind this is male.
That said, I have the same problem. Even hard coding the IP address I resolved through Google doesn't seem to work. I'm guessing their sabotage may have backfired and is causing issues beyond their intentional scope?
Just as I wrote.
One interesting thing I'd like to mention are these tweets[1] by archive.is when he was supposedly questioned for something at the finnish - russian border and as a result he blocked the entire site in Finland, although later he lifted the block. I also couldn't find any information about the "Russia vs. http://archive.org case" he mentions in the tweet.
No.
I do have 40+ years of observing commenters on internet forums assuming unknowns are male by default.
Maybe think about that?
For those aware the juxtaposition twixt { King of Men } | { Woman who created massive scientific paper archive } tends to dominate.
But sure, not everybody reads things the same way.
x=23.137.248.133 # NL
x=41.77.143.21 # GB
x=51.38.69.52 # GB
x=51.79.250.183 # SG
x=79.133.51.130 # DE
x=89.253.237.217 # RU
x=90.156.209.190 # RU
x=90.156.209.190 # RU
x=91.193.43.144 # NL
x=94.140.114.194 # LV
x=130.0.232.208 # UA
x=139.99.171.251 # AU
x=139.99.89.157 # SG
x=178.17.174.208 # MD
x=178.250.243.66 # RU
x=185.101.35.175 # NO
x=185.125.168.154 # NO
x=188.143.233.210 # RU
x=192.124.216.250 # RU
x=192.210.214.166 # US
x=193.148.248.205 # NL
x=193.233.203.196 # MD
x=217.197.116.88 # RU
To test, something like printf 'GET /timemap/example.com HTTP/1.0\r\nhost: archive.is\r\nconnection: close\r\n\r\n'|openssl s_client -connect $x:443 -ign_eof
echo $x archive.is >> /etc/hosts
curl -0A "" https://archive.is/timemap/example.comChalk the snotty comment, and irrational YC worship, up to your basic herd mentality. The author is, I would bet, a typical HN poster.
That's amazing, I never bothered to take a look once I saw that page but I did just now, and you're right. Google reCAPTCHA skinned as Cloudflare, hysteric.
This is why I consider these types of archives important: not so much to bypass paywalls, but to ensure content is still available in a decade, or two decades.
> Github ... account called “volth” ... contributed ... to NixOS
>
Volth maintained NixOS Perl subsystem:
https://github.com/NixOS/nixpkgs/commits/master?after=1c72dc...
>
> The obvious denispetrov.com ... programmer ... a New Yorker ... end of a 25-year career and the blog dries up entirely in 2011, so it doesn’t match the place or time
>
A Perl programmer: http://web.archive.org/web/20050208095206/http://www.denispe...
Archive.is started in 2012, just after retirement, why these do not match?
The nixos link was edited and removed by the author 3 hours after this submission was posted to HN.
Checking who wrote this blog, the About starts with:
> Jani Patokallio was first bitten by the travel bug at the age of 8 months and hasn't managed to shake it yet. Halfway through racking up 650,000 flight miles
sounds like a nice person (next, they'll tell us how much plastic they bought in a lifetime!), but that aspect aside, I'm not seeing any motive for why archive's personalia should need to be dug into...
Also, «Маша» (Masha) and «Мойша» (Moishe/Moshe) are completely different names, and I've never ever seen anyone using the former for the latter. Either the author stretches it a bit too much, or the author knows something that should not be publicly revealed in the manner they chose (and the whole post is just an intimidating leak).
Anyway, if the author(s) have successful illegal business, as implied, they shouldn't have any difficulties in acquiring enough spare identities to burn. As a side note, it's quite ironic that “security” is such an idol today that common people need to go out of their way to evade tracking, while even petty internet criminals buy virtual identities in bulk, and have special instrumented browsers to load fake system data with one click.
https://reestr.rublacklist.net/en/?page=12&q=archive.org
You can see the fine collection of everything from The Anarchist Cookbook to le ironic nasheed remixes, and from exposures of Astral Jews to Alex Jones there:
That suggests the deeper issue might be language fluency, English can be difficult to parse.
Link Rot (link death, Link Breaking, or Reference Rot) : https://en.wikipedia.org/wiki/Link_rot
I had to switch back to my ISP DNS to have connection successful.
I did not realize that choice of DNS resolver could effect access to a website like this. I thought DNS was boring stable technology. The error conditions weren't even DNS failure (which I would also find surprising from Google or Cloudflare), but that server timeout, or weirder infinite captcha loop.