The Open-Source Software Saving the Internet From AI Bot Scrapers
The Open-Source Software Saving the Internet From AI Bot Scrapers

Anubis, which block AI scrapers from scraping websites to death, has been downloaded almost 200,000 times.

The Open-Source Software Saving the Internet From AI Bot Scrapers
Anubis, which block AI scrapers from scraping websites to death, has been downloaded almost 200,000 times.
You're viewing a single thread.
Would you edit your post and add the following archive link to the body, please?
Unfortunately, archiveis seems to have moved behind a big corporate CAPTCHA service, subjecting readers to having their reading habits (both the articles and the referring communities) tracked at a large scale.
I suggest this archive link instead:
Unfortunately, archive.is has moved behind Cloudflare, subjecting readers to having their reading habits (both the articles and the referring communities) tracked at a large scale.
How do you know this?
What about https://ghostarchive.org/?
Sorry; I shouldn't have written Cloudflare specifically. Their CAPTCHA page now contains scripts from Google, not Cloudflare. I have corrected my comment.
How do you know this?
Because a couple months ago, archiveis/archivetoday started showing me CAPTCHA pages instead of the archived articles when I use Firefox with scripts disabled. The current page contains scripts hosted by Google, which I won't enable, so I can't read the archived articles.
What about https://ghostarchive.org/?
I haven't used that site enough to have a consistent picture of what it's doing. When I tried it a few minutes ago, it directed me to a CAPTCHA wall when trying to submit an article, but not when searching for an archived article. I'll try to remember to look at it again periodically, to be able to answer this question in the future.
Thanks. I appreciate the info and effort.
To be honest with you, I refuse on moral grounds. 404 are independent and do good work. You've already linked a pay wall bypass in the comments, if anyone would like to find it, it's not hard to scroll.
OK. Fair enough.