
Filthy robots! There you are at the bar, just trying to lock eyes with a potential customer and those robots keep sliding their mecha-tendrils all over your back pages. At least it can feel that way sometimes in the online world. So much attention is paid to getting the attention of those ‘bots and droids that getting the right kind of attention is sometimes overlooked.
There are parts of your site that you might want robots to keep away from for better search optimization and indexing. Typical examples of these naughty bits include:
- Duplicate content from printer friendly pages and dynamically rendered pages.
- Confirmation and thank you pages that have little useful content.
- Pages designed for internal use only (this would often be a test page or even a page with sensitive information that’s hosted on your site but not viewable by the general public.)
- Form pages or login pages.

Thankfully there are a lot of options out there that websites can use to keep those robot’s oily fingers off of their junk. Yesterday Microsoft (full disclosure – a Reprise Media client) Yahoo! and Google all teamed up like some kind of supergroup of search to provide joint documentation on REP – Robot Exclusion Protocol. Here’s what they came up with and what it’s designed to do:
1. Robots.txt Directives
2. HTML
Keep in mind that a link on another site to a page that uses REP can undo all that carefully applied robot repellent and send you back to square one. This is a rare occurrence but it does happen.


Thanks for all the 411 on this. It’s tough when you’re at the bar and those damn robots go on and mess with ya!
Thanks for the 411 on this. It’s tough when you’re at the bar and those damn robots start messin’ with ya!
[...] – SearchViews: Handy Dandy Surefire Robot Repellent Formulas [...]