Simple Machines Community Forum

SMF Support => SMF 1.1.x Support => Aiheen aloitti: galapagos - kesäkuu 12, 2011, 12:31:18 IP

Otsikko: How to Block SEARCH BOTS to my forum?
Kirjoitti: galapagos - kesäkuu 12, 2011, 12:31:18 IP
HI guys, I don't want search bots browsing my forum (google, yahoo!, msn etc) how do I block them?
Otsikko: Re: How to Block SEARCH BOTS to my forum?
Kirjoitti: Illori - kesäkuu 12, 2011, 12:34:07 IP
do some googling for robots.txt only good bots will follow that file though.
Otsikko: Re: How to Block SEARCH BOTS to my forum?
Kirjoitti: kat - kesäkuu 12, 2011, 12:37:44 IP
You might want to check some of these out, too.

http://www.google.co.uk/search?client=opera&rls=en-GB&rls=en-GB&q=htaccess+bots&sourceid=opera&ie=utf-8&oe=utf-8&channel=suggest
Otsikko: Re: How to Block SEARCH BOTS to my forum?
Kirjoitti: galapagos - kesäkuu 12, 2011, 12:55:14 IP
thanks guys.. Never done it so please bear with me.. So do I have to create a .txt file with

User-agent: *
Disallow: /


and it'll block all the robots?
Otsikko: Re: How to Block SEARCH BOTS to my forum?
Kirjoitti: kat - kesäkuu 12, 2011, 01:00:54 IP
The problem with robots.txt, is that the norty bots ignore it.

Have a look at the sites that come-up, from the list I posted.

.htaccess stops far more bots than robots.txt does.
Otsikko: Re: How to Block SEARCH BOTS to my forum?
Kirjoitti: galapagos - kesäkuu 12, 2011, 01:51:03 IP
so would I just copy this code:

RewriteEngine On
RewriteCond %{HTTP_USER_AGENT} ^BlackWidow [OR]
RewriteCond %{HTTP_USER_AGENT} ^Bot\ mailto:[email protected] [OR]
RewriteCond %{HTTP_USER_AGENT} ^ChinaClaw [OR]
RewriteCond %{HTTP_USER_AGENT} ^Custo [OR]
RewriteCond %{HTTP_USER_AGENT} ^DISCo [OR]
RewriteCond %{HTTP_USER_AGENT} ^Download\ Demon [OR]
RewriteCond %{HTTP_USER_AGENT} ^eCatch [OR]
RewriteCond %{HTTP_USER_AGENT} ^EirGrabber [OR]
RewriteCond %{HTTP_USER_AGENT} ^EmailSiphon [OR]
RewriteCond %{HTTP_USER_AGENT} ^EmailWolf [OR]
RewriteCond %{HTTP_USER_AGENT} ^Express\ WebPictures [OR]
RewriteCond %{HTTP_USER_AGENT} ^ExtractorPro [OR]
RewriteCond %{HTTP_USER_AGENT} ^EyeNetIE [OR]
RewriteCond %{HTTP_USER_AGENT} ^FlashGet [OR]
RewriteCond %{HTTP_USER_AGENT} ^GetRight [OR]
RewriteCond %{HTTP_USER_AGENT} ^GetWeb! [OR]
RewriteCond %{HTTP_USER_AGENT} ^Go!Zilla [OR]
RewriteCond %{HTTP_USER_AGENT} ^Go-Ahead-Got-It [OR]
RewriteCond %{HTTP_USER_AGENT} ^GrabNet [OR]
RewriteCond %{HTTP_USER_AGENT} ^Grafula [OR]
RewriteCond %{HTTP_USER_AGENT} ^HMView [OR]
RewriteCond %{HTTP_USER_AGENT} HTTrack [NC,OR]
RewriteCond %{HTTP_USER_AGENT} ^Image\ Stripper [OR]
RewriteCond %{HTTP_USER_AGENT} ^Image\ Sucker [OR]
RewriteCond %{HTTP_USER_AGENT} Indy\ Library [NC,OR]
RewriteCond %{HTTP_USER_AGENT} ^InterGET [OR]
RewriteCond %{HTTP_USER_AGENT} ^Internet\ Ninja [OR]
RewriteCond %{HTTP_USER_AGENT} ^JetCar [OR]
RewriteCond %{HTTP_USER_AGENT} ^JOC\ Web\ Spider [OR]
RewriteCond %{HTTP_USER_AGENT} ^larbin [OR]
RewriteCond %{HTTP_USER_AGENT} ^LeechFTP [OR]
RewriteCond %{HTTP_USER_AGENT} ^Mass\ Downloader [OR]
RewriteCond %{HTTP_USER_AGENT} ^MIDown\ tool [OR]
RewriteCond %{HTTP_USER_AGENT} ^Mister\ PiX [OR]
RewriteCond %{HTTP_USER_AGENT} ^Navroad [OR]
RewriteCond %{HTTP_USER_AGENT} ^NearSite [OR]
RewriteCond %{HTTP_USER_AGENT} ^NetAnts [OR]
RewriteCond %{HTTP_USER_AGENT} ^NetSpider [OR]
RewriteCond %{HTTP_USER_AGENT} ^Net\ Vampire [OR]
RewriteCond %{HTTP_USER_AGENT} ^NetZIP [OR]
RewriteCond %{HTTP_USER_AGENT} ^Octopus [OR]
RewriteCond %{HTTP_USER_AGENT} ^Offline\ Explorer [OR]
RewriteCond %{HTTP_USER_AGENT} ^Offline\ Navigator [OR]
RewriteCond %{HTTP_USER_AGENT} ^PageGrabber [OR]
RewriteCond %{HTTP_USER_AGENT} ^Papa\ Foto [OR]
RewriteCond %{HTTP_USER_AGENT} ^pavuk [OR]
RewriteCond %{HTTP_USER_AGENT} ^pcBrowser [OR]
RewriteCond %{HTTP_USER_AGENT} ^RealDownload [OR]
RewriteCond %{HTTP_USER_AGENT} ^ReGet [OR]
RewriteCond %{HTTP_USER_AGENT} ^SiteSnagger [OR]
RewriteCond %{HTTP_USER_AGENT} ^SmartDownload [OR]
RewriteCond %{HTTP_USER_AGENT} ^SuperBot [OR]
RewriteCond %{HTTP_USER_AGENT} ^SuperHTTP [OR]
RewriteCond %{HTTP_USER_AGENT} ^Surfbot [OR]
RewriteCond %{HTTP_USER_AGENT} ^tAkeOut [OR]
RewriteCond %{HTTP_USER_AGENT} ^Teleport\ Pro [OR]
RewriteCond %{HTTP_USER_AGENT} ^VoidEYE [OR]
RewriteCond %{HTTP_USER_AGENT} ^Web\ Image\ Collector [OR]
RewriteCond %{HTTP_USER_AGENT} ^Web\ Sucker [OR]
RewriteCond %{HTTP_USER_AGENT} ^WebAuto [OR]
RewriteCond %{HTTP_USER_AGENT} ^WebCopier [OR]
RewriteCond %{HTTP_USER_AGENT} ^WebFetch [OR]
RewriteCond %{HTTP_USER_AGENT} ^WebGo\ IS [OR]
RewriteCond %{HTTP_USER_AGENT} ^WebLeacher [OR]
RewriteCond %{HTTP_USER_AGENT} ^WebReaper [OR]
RewriteCond %{HTTP_USER_AGENT} ^WebSauger [OR]
RewriteCond %{HTTP_USER_AGENT} ^Website\ eXtractor [OR]
RewriteCond %{HTTP_USER_AGENT} ^Website\ Quester [OR]
RewriteCond %{HTTP_USER_AGENT} ^WebStripper [OR]
RewriteCond %{HTTP_USER_AGENT} ^WebWhacker [OR]
RewriteCond %{HTTP_USER_AGENT} ^WebZIP [OR]
RewriteCond %{HTTP_USER_AGENT} ^Wget [OR]
RewriteCond %{HTTP_USER_AGENT} ^Widow [OR]
RewriteCond %{HTTP_USER_AGENT} ^WWWOFFLE [OR]
RewriteCond %{HTTP_USER_AGENT} ^Xaldon\ WebSpider [OR]
RewriteCond %{HTTP_USER_AGENT} ^Zeus
RewriteRule ^.* - [F,L]



into a notepad, then rename to 1.htaccess and put it into my directory?


Lainaus käyttäjältä: K@ - kesäkuu 12, 2011, 01:00:54 IP
The problem with robots.txt, is that the norty bots ignore it.

Have a look at the sites that come-up, from the list I posted.

.htaccess stops far more bots than robots.txt does.
Otsikko: Re: How to Block SEARCH BOTS to my forum?
Kirjoitti: kat - kesäkuu 12, 2011, 01:58:38 IP
Not the "1".

Just .htaccess

Not extension.
Otsikko: Re: How to Block SEARCH BOTS to my forum?
Kirjoitti: galapagos - kesäkuu 12, 2011, 02:38:30 IP

thanks a lot! I've just compiled it.. but in that list of bots there's no google/yahoo/msn   should I create robots.txt as well?

Lainaus käyttäjältä: K@ - kesäkuu 12, 2011, 01:58:38 IP
Not the "1".

Just .htaccess

Not extension.
Otsikko: Re: How to Block SEARCH BOTS to my forum?
Kirjoitti: kat - kesäkuu 13, 2011, 06:10:35 AP
Can't hurt. :)

You can set .htaccess to prevent those, too, though.

I'm not sure exactly how, though, I'm afraid.
Otsikko: Re: How to Block SEARCH BOTS to my forum?
Kirjoitti: galapagos - kesäkuu 13, 2011, 11:55:55 AP
thank you so much! :)
Otsikko: Re: How to Block SEARCH BOTS to my forum?
Kirjoitti: kat - kesäkuu 13, 2011, 11:57:43 AP
Pleasure, mate!