zlacker

[parent] [thread] 6 comments
1. cpncru+(OP)[view] [source] 2025-12-06 00:27:22
I have been managing production commercial web servers for 28 years.

Yes, there are various bots, and some of the large US companies such as Perplexity do indeed seem to be ignoring robots.txt.

Is that a problem? It's certainly not a problem with cpu or network bandwidth (it's very minimal). Yes, it may be an issue if you are concerned with scraping (which I'm not).

Cloudflare's "solution" is a much bigger problem that affects me multiple times daily (as a user of sites that use it), and those sites don't seem to need protection against scraping.

replies(2): >>filled+v2 >>kviran+R3
2. filled+v2[view] [source] 2025-12-06 00:48:50
>>cpncru+(OP)
It is rather disingenuous to backpedal from "you can easily block them" to "is that a problem? who even cares" when someone points out that you cannot in fact easily block them.
replies(1): >>cpncru+h4
3. kviran+R3[view] [source] 2025-12-06 01:03:38
>>cpncru+(OP)
Security almost always brings inconvenience (to everyone involved, including end users). That is part of its cost.
replies(1): >>cpncru+16
◧◩
4. cpncru+h4[view] [source] [discussion] 2025-12-06 01:07:59
>>filled+v2
I was referring to legitimate ones, which you can easily block. Obviously there are scammy ones as well, and yes it is an issue, but for most sites I would say the cloudflare cure is worse than the problem it's trying to cure.
replies(1): >>oasisb+eo1
◧◩
5. cpncru+16[view] [source] [discussion] 2025-12-06 01:22:25
>>kviran+R3
What security issue is actually being solved here though?
◧◩◪
6. oasisb+eo1[view] [source] [discussion] 2025-12-06 16:34:31
>>cpncru+h4
No true scotsman needs Cloudflare, as any true scotsman can block AI bots themselves is not a strong argument.
replies(1): >>cpncru+pi2
◧◩◪◨
7. cpncru+pi2[view] [source] [discussion] 2025-12-07 00:30:45
>>oasisb+eo1
But is there any actual evidence that any major AI bots are bypassing robots.txt? It looked as if Perplexity was doing this, but after looking into it further it seems that likely isn't the case. Quite often people believe single source news stories without doing any due diligence or fact checking.
[go to top]