robots.txt Adventure. Interesting notes from crawling 4.6 million robots.txt, including 69 different ways in which the word “disallow” can be mis-spelled.
robots.txt Adventure. Interesting notes from crawling 4.6 million robots.txt, including 69 different ways in which the word “disallow” can be mis-spelled.
Funny,
I just yesterday pieced together a little Django app which handles robots.txt requests, manageable with the admin interface (currently only with the oldforms-admin/trunk). Thanks for the great link!
http://code.google.com/p/django-robots/