# Sample robots.txt file (make sure the filename is ALL LOWERCASE on Linux/Unix systems) # This file should go in your web site's ROOT directory # The root directory is where your site's main /index.html file would be found # It is usually found in /yourhomedir/public_html/ or /yourhomedir/httpdocs # Where "yourhomedir" is your user account's name # This says to apply these settings to ALL search engine spiders/crawlers User-agent: * # These settings will keep spiders from indexing your unwanted PAGES # This assumes that your OSC install is in your web site's ROOT directory # ie: http://www.yoursite.com/index.php <- Use if this brings up your OSC main page Disallow: /account.php Disallow: /advanced_search.php Disallow: /checkout_shipping.php Disallow: /create_account.php Disallow: /login.php Disallow: /login.php Disallow: /password_forgotten.php Disallow: /popup_image.php Disallow: /shopping_cart.php Disallow: /ssl.check.php Disallow: /account.php Disallow: /account_edit.php Disallow: /account_history.php Disallow: /account_history_info.php Disallow: /account_news_letters.php Disallow: /account_notifications.php Disallow: /account_passwords.php Disallow: /address_book.php Disallow: /address_book_process.php Disallow: info_pages.php Disallow: /info_shopping_cart.php Disallow: /ipn.php Disallow: /login.php Disallow: /logoff.php Disallow: /password forgotten.php Disallow: /popup_paypal.php Disallow: /product_reviews_info.php Disallow: /product_reviews_write.php Disallow: /product_reviews_info.php Disallow: /allprods_with model.php Disallow: /webalizer # These settings will keep spiders from indexing your unwanted FOLDERS # This assumes that your OSC install is in your web site's ROOT directory # ie: http://www.yoursite.com/catalog/index.php <- Use if this brings up your OSC main page Disallow: /webalizer Disallow: /video Disallow: /usage Disallow: /Templates Disallow: /pub Disallow: /other_files Disallow: /newsletter Disallow: /forum_pics Disallow: /formmail Disallow: /extras Disallow: /ebay Disallow: /calendar Disallow: /admin Disallow: /includes Disallow: /_vt_txt Disallow: /_vti_pvt Disallow: /_vti_log Disallow: /_vti_cnf Disallow: /_vti_bin Disallow: /private Disallow: /_notes Disallow: /cgi-bin/ # Feel free to add any other pages on your site that you don't want to be indexed by # the search engines. # PLEASE NOTE: Any pages that you list here should be secured by other means if you # don't want people to be able to view them, as some malicious users will look at a # robots.txt file to try to find "hidden" or "secret" areas of web sites to find # confidential information. # Just Uncomment a line or add new ones as you see fit. # Disallow: /private # Disallow: /hidden # IF YOU DO NOT WISH TO HAVE THE GOOGLE IMAGE BOT SCAN YOUR DOMAIN FOR IMAGES # THEN YOU CAN INCLUDE THE FOLLOWING IN YOUR ROBOTS FILE. # I FOUND THAT MY BANDWIDTH USAGE DROPPED BY A MASSIVE AMOUNT AFTER I GOT RID # OF THE GOOGLE IMAGE BOT. ALL I HAD WAS IMAGE HUNTERS STEALING PRODUCT SHOTS # AND NOT EVEN BROWSING THE SITE. User-agent: Googlebot-Image Disallow: /gallery/Ace Hobby Niagara Falls NY Disallow: /ebay Disallow: /forum_pics Disallow: /Doug Hill Montreal Disallow: /Larry Anderson Disallow: /res Disallow: /slides Disallow: /Slot Car Circuit Disallow: /thumbs Disallow: /Whitby Raceway