############################### # # User-agent: * # # list folders robots are not allowed to index # Disallow: /BannerManager/ Disallow: /Admin/ Disallow: /ProdImages/ Disallow: /Images/ Disallow: /frank/ #Internet Archive Wayback Machine: Takes a periodic snapshot of your site, making it available for browse/search years after pages may have been taken down. To block it, put these lines in your robots.txt file: User-agent: ia_archiver Disallow: / #Google Images, Yahoo Image Search, PicSearch: These crawlers look for images on your site, make a best-guess as to their content, and make it easy for everyone to view or download. Depending on whether you think this is good or bad, you may want to block them. Add these lines to your robots.txt file: User-agent: Googlebot-Image Disallow: / User-agent: Yahoo-MMCrawler Disallow: / User-agent: psbot Disallow: / # # list specific files robots are not allowed to index # #Disallow: /directory/foo.html #Disallow: /folder2/whatever.html # # ###############################