The bots have begun to access my website way more often. Iām getting about 120k hits on https://www.uninformativ.de/git/ now in a couple of hours.
They donāt cache anything, probably on purpose.
It comes in waves. I get about 100 hits (all at once) on that /git
endpoint, all from different IPs. Then it takes a moment until I get another wave of about 500-1000 requests (all at once) where they do HEAD
requests on some of the paths below /git
. I assume they did a GET
earlier and are now checking if something has changed.
#sux32qq
(#sux32qq) It doesnāt pose a problem for my serverās performance ā yet. But if more bots/companies start doing this, my website will go down from the load.
#evgtvea
(#sux32qq) This probably means that I can no longer host my own website. I donāt want to deploy something like Anubis, because that ruins the whole thing: I want it to be accessible from ancient browsers, like OS/2 or Windows 3.11.
Iāll keep an eye on it for a while. Maybe try to block some IPs.
Sooner or later, Iāll take the website down and shift everything to Gopher.
#qiq5bnq
(#sux32qq) Why do I care about this?
- The load will become a problem at some point.
- These crawlers and the current āAIā in general are breaking the rules. I am supposed to be paying for every little thing, I get sued for āpiracyā. But apparently, these rules only apply to me. If I had more money, I could break them. Fuck that.
- I simply donāt want it. Period.
#2s2wjga
(#sux32qq) āBut all your stuff is MIT licensed! They are allowed to do that!ā
Haha. As if they would care. They crawl everything they get their hands on.
Besides, thatās not true, the license states that the copyright notice must be retained. āAIā breaks that. They incorporate my code and my articles in their product and make it appear as if it was their work.
#imftnja
(#sux32qq) @movq@www.uninformativ.de Right now Iām basically just blocking entire ASN(s) at this point and large blocks of IP(s) from Anthropic, OPenAI, Microsoft and others.
#nbo4v7q
(#sux32qq) @prologic@twtxt.net Yeah, Iāve blocked some large subnets now (most likely overblocking a lot of stuff) and it has died down.
Iām not looking forward to doing this on a regular basis. This is supposed to be a fun hobby ā and it was, for many years. Maybe that time is just over.
#kd7r2tq
(#sux32qq) As expected: Didnāt last long. Theyāre coming from different IPs now.
Iāve read enough blog posts by other people to know that this is probably pointless. The bots have so many IPs/networks at their disposal ā¦
#5qaxnia
(#sux32qq) @dce@hashnix.club Yeah, Iāve read about that approach. Sounds clever. Truth is, Iām too tired. š¢ I donāt want to spend too much of my time fighting assholes.
Iāve now started blocking entire cloud hosters. Sorry, not sorry.
#znee76q