TIGblogs TIG | TIGblogs GROUP TIGBLOGS LOGIN SIGNUP
betzster's Blog
betzster's Blog
A Contact Page to Avoid Spam

There are too many website owners displaying their email address in the “name [at] tld [dot] com” format, or something like that. Whatever happened to the good old mailto link with the email address being displayed how it’s supposed to? I’ll tell you what happened — spam happened.

Many site owners are afraid of displaying their email address, as it should be displayed, on their site. Since spammers can use scripts to scan websites, the same way google does, to look for email addresses. I’ve been seeing more and more contact forms and weirdly formatted addresses to combat the issue, but there’s a better way.

The robots.txt file in the root of your website is where you control what these bots can do and what they can’t. So if you’ve got your contact info on a certain page, then don’t allow bots to view that page. Bingo.

Now, the robots.txt file isn’t a standard and there’s nothing that guarantees that spam bots will follow the rules, but we can take this a step further.

As described over on the spamcop forums you can create  bot trap. Basically, you create a script that redirects to a 403 page. On the redirect page you can output anything: random email addresses, addresses that you want to monitor for spam, etc. Then you make sure legit bots that follow the rules of your robots.txt file don’t get caught in the trap by putting a “Disallow: <your bot trap>” in the robots.txt file. To make sure the bad bots get caught in the trap you put an anchor link inside an HTML comment as the first and last lines of each page. When the bad bots see the link they will follow it and then they get the 403 error.

This isn’t bulletproof, but I think it helps just to disallow your contact page from being monitored by legitimate bots like Google. I don’t think you want your contact page showing up in Google cached pages or Google search results anyway.

Resources: The Web Robots Pages (How to use robots.txt)


January 31, 2010 | 12:01 PM Comments  0 comments

Tags:
You must be logged in to add tags.


Josh's Profile


Latest Posts
Network Favicons
Set up VirtualBox for...
Facebook Hackathon...
On Comments
WordPress &...

Monthly Archive
April 2007
May 2007
June 2007
July 2007
August 2007
September 2007
October 2007
November 2007
December 2007
March 2008
April 2008
May 2008
June 2008
July 2008
September 2008
October 2008
November 2008
December 2008
January 2009
February 2009
March 2009
April 2009
May 2009
June 2009
July 2009
August 2009
September 2009
October 2009
November 2009
December 2009
January 2010
February 2010
March 2010
April 2010
June 2010
July 2010
August 2010
September 2010
October 2010
November 2010
December 2010
January 2011
February 2011
March 2011
April 2011
June 2011
September 2011
October 2011
November 2011
December 2011
January 2012

Change Language


Tags Archive
apache apple asides band blog blogs coding design dreamweaver florida gallery general globalwarming hosting internet johnmayer network nifty php plugins school service tech theme themes twitter vacation web wordpress youtube

Links
High School Reunion
Josh Betz Blog
Josh Betz Professional
Josh Betz Projects


27736 views
Important Disclaimer