zlacker

[return to "Google to explore alternatives to robots.txt"]
1. blackl+X9[view] [source] 2023-07-08 07:36:27
>>skille+(OP)
Why are those folks trying to sprinkle AI over everything, even when it's completely inappropriate?

There's no AI involved in web crawling. If you come to my site, I'll tell you which pages you can visit/index, and which pages you can't, end of the story

Yes, there are security concerns with people putting /very-secret-admin-panel in their robots.txt and letting malicious actors know what URLs they should target. But if /very-secret-admin-panel is never linked by any public page, then the bot won't encounter it, therefore this stuff should never belong to robots.txt.

Please keep it as straightforward as this and don't add any AI bullshit to one of the few remaining simple processes in web development and administration.

◧◩
2. iamphi+Ca[view] [source] 2023-07-08 07:45:10
>>blackl+X9
Perhaps they’re intending on a means to say whether your content can be used within an AI training model or not.
[go to top]