zlacker

[parent] [thread] 0 comments
1. shadow+(OP)[view] [source] 2023-05-10 15:20:48
> What has robots.txt gotten us

A standard protocol for reputable crawlers to semantically understand some high-level page navigation rules.

Actual, useful crawling (i.e. to build search indices) would be much messier and more useless without most interesting sites putting up meaningful robots.txt guide-rails. Look at facebook.com/robots.txt and consider how much crap both Facebook and indexers would have to deal with lacking that information.

[go to top]