zlacker

[parent] [thread] 1 comments
1. annoyi+(OP)[view] [source] 2023-05-10 14:00:03
Do we need more features that are generally ignored? What has robots.txt gotten us? What has Do Not Track gotten us?
replies(1): >>shadow+hj
2. shadow+hj[view] [source] 2023-05-10 15:20:48
>>annoyi+(OP)
> What has robots.txt gotten us

A standard protocol for reputable crawlers to semantically understand some high-level page navigation rules.

Actual, useful crawling (i.e. to build search indices) would be much messier and more useless without most interesting sites putting up meaningful robots.txt guide-rails. Look at facebook.com/robots.txt and consider how much crap both Facebook and indexers would have to deal with lacking that information.

[go to top]