Link Exchange Forum
Search  Search   Memberlist  Certified Members  
Register  ::  Log in Log in to check your private messages


Post new topic  This topic is locked: you cannot edit posts or make replies.
 robots.txt « View previous topic :: View next topic » 
Author Message
Sebastian
PostPosted: Tue Sep 18, 2007 4:07 am    Post subject: robots.txt Reply with quote

Admin

Joined: 08 Apr 2007
Posts: 51

Learn how to use robots.txt to control how search engines and crawlers access your site.


# disallow all crawlers
User-agent: *


# disallow a specific crawler
User-agent: GoogleBot


# disallow all files
Disallow: /


# disallow specific urls
Disallow: /login.php
Disallow: /support/faq.html
Disallow: /contact.asp


# disallow all files with a ? in url
Disallow: /*?*


# disallow all files ending with these extensions
Disallow: /*.php$
Disallow: /*.js$
Disallow: /*.inc$
Disallow: /*.css$
Disallow: /*.txt$


# disallow all files in a specific directory
Disallow: /admin/
Disallow: /comments/


# disallow archiving site
User-agent: ia_archiver
Disallow: /


# allow google image bot to search all images
User-agent: Googlebot-Image
Allow: /*

Note: The robots.txt file must be uploaded to the root accessible directory of your site (ie: mysite.com/robots.txt)
_________________
Please do not email or PM me with support questions. Please post them on the forum instead.
Back to top
Display posts from previous:   
Post new topic  This topic is locked: you cannot edit posts or make replies. Page 1 of 1

Jump to:  

 
SEO Friendly Web DirectorySEO Friendly Solid Crawler Web Directory Solid Crawler Paid Directory Paid Directory Directory of Directories Directory of Directories


DAJ Glass (1.0.8) template by Dustin Baccetti
Powered by phpBB © 2001, 2005 phpBB Group