Computer Geeks

Computer Geeks

Geek Shop

Geek News

Geek Stuff

Science Geek

Computer Gaming

Linux Chat

Building Websites

Computer Forums

Computer Help Forum

Computer Hardware Forum

Computer Software Programs


Go Back   Computer Forums > Geek Stuff
FAQ Community Calendar Today's Posts Search

Geek Stuff General forum for discussing anything geek. If your topic doesn't fit in any of the other forums then post it here.

Computer Geeks
» Active Discussions
Computer Geeks
No Threads to Display.
» Other Websites
- Software Publishing

- Server Hardening
Reply
 
Thread Tools Display Modes
  #1  
Old 04-21-2006, 04:24 AM
yetanotherfcw yetanotherfcw is offline
Member
GB Advanced User
 
Join Date: Mar 2006
Posts: 60
Default Get robots.txt to work

So, I used the following robots.txt, but it does not seem to work.
Quote:
Disallow: /post-*.html$
Disallow: /updates-topic.html*$
Disallow: /stop-updates-topic.html*$
Disallow: /ptopic*.html$
Disallow: /ntopic*.html$
I really want this to work, so I am thinking about making some changes like the followings:
Quote:
Disallow: /post-*.html
Disallow: /updates-topic.html*
Disallow: /stop-updates-topic.html*
Disallow: /ptopic*.html
Disallow: /ntopic*.html
Quote:
Disallow: /post-*.html
Disallow: /updates-topic.html
Disallow: /stop-updates-topic.html
Disallow: /ptopic*.html
Disallow: /ntopic*.html
How can I get this robots.txt to work? What (minor) modifications do I need to make?
Reply With Quote
  #2  
Old 04-21-2006, 05:02 AM
Soulwatcher's Avatar
Soulwatcher Soulwatcher is offline
Senior Member
GB GEEK
 
Join Date: Feb 2006
Posts: 309
Send a message via MSN to Soulwatcher
Default

I dont know much about robots.txt. But Nikolas over at http://www.webdigity.com/ could probbaly answer all of your questions.


Soulwatcher
Reply With Quote
  #3  
Old 04-21-2006, 05:08 AM
southernlady's Avatar
southernlady southernlady is offline
Junior Member
GB Beginner
 
Join Date: Apr 2006
Posts: 20
Send a message via AIM to southernlady Send a message via MSN to southernlady Send a message via Yahoo to southernlady
Default

yetanotherfcw, why do you want to turn off your robot search? That's where your traffic will come from eventually. If you don't allow the google, msn, yahoo search bots to search then you limit your access to their directories. Liz
Reply With Quote
  #4  
Old 04-21-2006, 06:53 AM
Coop Coop is offline
Member
GB Beginner
 
Join Date: Apr 2006
Posts: 34
Default

I'm not an expert on robots.txt either, but I was under the impression that you needed a User-agent line for it to work, so try this

Quote:
User-agent: *

Disallow: /post-*.html
Disallow: /updates-topic.html*
Disallow: /stop-updates-topic.html*
Disallow: /ptopic*.html
Disallow: /ntopic*.html
Also, not all robots understand the wildcards yet, and some completely ignor the robots.txt, so if you are having trouble with a particular crawler, you may need to add differant sections for each one, or even ban there IP blocks (usually required for the email harvestors etc).

Coop
__________________
Coop
Reply With Quote
  #5  
Old 04-21-2006, 07:22 AM
Ashley's Avatar
Ashley Ashley is offline
Junior Member
GB Newbie
 
Join Date: Mar 2006
Posts: 12
Default

The best thing to do would be to use a robots.txt generator. Try Googling, there are several. This will make sure that bots can understand it.

Secondly, not every bot follows robots.txt, so if you want to block them you will want to use .htaccess.

Make sure that it is in the root of your site [e.g /public_html] and chmodded so that it's easy to view.
Reply With Quote
  #6  
Old 04-21-2006, 09:55 AM
LucnetSolutions's Avatar
LucnetSolutions LucnetSolutions is offline
Member
GB Beginner
 
Join Date: Mar 2006
Posts: 49
Send a message via AIM to LucnetSolutions Send a message via MSN to LucnetSolutions Send a message via Yahoo to LucnetSolutions
Default

Go to google and sign up for there validation. They hve step by step instructions to help you. They helped me to set mine up.
__________________
Lucnet Solutions - World Wide Hosting Solutions
Reply With Quote
  #7  
Old 04-21-2006, 07:44 PM
yetanotherfcw yetanotherfcw is offline
Member
GB Advanced User
 
Join Date: Mar 2006
Posts: 60
Default

Quote:
Originally Posted by LucnetSolutions
Go to google and sign up for there validation. They hve step by step instructions to help you. They helped me to set mine up.
Is there a link for this?
Reply With Quote
Reply


Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is On
HTML code is Off

Forum Jump

Powered by vBadvanced CMPS v3.2.3

All times are GMT -5. The time now is 03:09 AM.


Powered by vBulletin® Version 3.8.7
Copyright ©2000 - 2024, Jelsoft Enterprises Ltd.
HTML Help provided by HTML Help Central.