UnitedForums - UK Web Hosting Forum UnitedHosting Community Hosting Forums
Network and Server StatusCustomer SupportUK Web Hosting
UnitedHostingUnitedHosting Sitemap UK Hosting ForumUK Web HostingWeb Hosting ForumsUK Reseller HostingWeb Host CommunityUK Managed Dedicated ServersHosting Help and SupportUK Domain Name Registration

Go Back   UnitedForums.co.uk > UnitedHosting Community > General Chat Lounge

Reply
 
Thread Tools Rate Thread Display Modes
Old 6th October 2009, 07:11 PM   #1 (permalink)
Nicky
·
 
Join Date: Aug 2009
Posts: 25
Stop Search Engines indexing URL

Hi everyone,

Does anyone know how to block the contents of a domain URL from being indexed and making it un-searchable via Google etc? Whilst robots.txt, various meta’s and directory/page locking are familiar to protect pages and directories, I just wondered if it was possible.

Why? I have a potential customer who wants website pages to be indexed but not the content within her chosen domain URL. In other words, she wants to remain vaguely anonymous but with a memorable name as a domain URL but does not want people to be able to search on her name and find it in a URL! …aarghh! Why do I always get the easy jobs!

I said; “It can’t be done, so please choose a non-descript domain name that wasn’t so obvious.” Only because that was the only answer I could think of. But nevertheless, that intrigued me. Is it possible to prevent the contents of an URL from being searched-on whilst not affecting the page content itself? Personally, I don’t think it is.

Just curious…

Last edited by Nicky : 7th October 2009 at 04:11 PM.
Nicky is offline   Reply With Quote
Old 8th October 2009, 06:39 AM   #2 (permalink)
acutecomputing
Senior Member
 
Join Date: Mar 2003
Location: Syracuse, Utah, USA
Posts: 137
Send a message via Skype™ to acutecomputing
I can think of a couple of ways;

1. Use Robots.txt - the well-behaved robots respect them.
2. Put all content in images - Still takes a lot of processing power to read them.
3. Use .htaccess to reject search engines (a pain, because you have to find reliable ways of identifying them)
4. Buy several more domains and duplicate the content on them. They might all get penalised and dropped from the index!
5. Don't link to the site - ever. Don't set up email on the site domain (as otherwise, if anyone on gmail emails her, the site will be indexed by Google eventually)

Hard to understand why she wants it that way, of course...


HTH,

Pete.
__________________
Utah based custom PHP and Drupal Development
www.kpdirection.com
www.kids-faith.com
acutecomputing is offline   Reply With Quote
Old 26th November 2009, 07:46 AM   #3 (permalink)
No Name
Registered User
 
Join Date: Nov 2009
Posts: 5
I vote for .htaccess. I had something like while testing the site and it worked.

RewriteEngine On
RewriteCond %{HTTP_USER_AGENT} ^Googlebot [NC,OR]
RewriteCond %{HTTP_USER_AGENT} ^.*Googlebot.*$ [NC]
RewriteRule .* - [F,L]

Never rely on robots.txt alone
__________________
Signature removed by moderator.
No Name is offline   Reply With Quote
Old 26th February 2010, 09:39 AM   #4 (permalink)
thinkbizom
Om Prakash
 
Join Date: Dec 2009
Posts: 7
Use Robots.txt

This file can be used to block crawlers from certain pages and folders on your site. Make sure you don't accidentally block them from pages you do want indexed. I personally don't really count the robots.txt file as part of on-page SEO, since it's not directly contributing to where you rank (you don't actually need a robots.txt file at all).

This example allows all robots
User-agent: *
Allow: /

This example keeps all robots out:
User-agent: *
Disallow: /


.htaccess code

RewriteEngine On
RewriteCond %{HTTP_USER_AGENT} ^Googlebot [NC,OR]
RewriteCond %{HTTP_USER_AGENT} ^.*Googlebot.*$ [NC]
RewriteRule .* - [F,L]
thinkbizom is offline   Reply With Quote
Old 2nd April 2010, 09:24 PM   #5 (permalink)
DavidAmerland
Registered User
 
DavidAmerland's Avatar
 
Join Date: Apr 2010
Location: Manchester
Posts: 3
Send a message via Skype™ to DavidAmerland
Intriguing

Quote:
Originally Posted by Nicky View Post
Hi everyone,

Does anyone know how to block the contents of a domain URL from being indexed and making it un-searchable via Google etc? Whilst robots.txt, various meta’s and directory/page locking are familiar to protect pages and directories, I just wondered if it was possible.

Why? I have a potential customer who wants website pages to be indexed but not the content within her chosen domain URL. In other words, she wants to remain vaguely anonymous but with a memorable name as a domain URL but does not want people to be able to search on her name and find it in a URL! …aarghh! Why do I always get the easy jobs!

I said; “It can’t be done, so please choose a non-descript domain name that wasn’t so obvious.” Only because that was the only answer I could think of. But nevertheless, that intrigued me. Is it possible to prevent the contents of an URL from being searched-on whilst not affecting the page content itself? Personally, I don’t think it is.

Just curious…
Nicky, you're right. This is idiosyncratic enough to border on the unique. You can block off entire parts of a site following the 'No Index' command in your robots.txt file - the Google bot will respect it and so will Bing and Yahoo! The rest really are of no real importance. That protects the contents of specific pages within a domain name (or entire sections of it) use a doorway page to present different content to search engine bots and human visitors and not only do you guarantee it will not be searched but if you do any SEO on it Google will drop it from the Index altogether.

I hope this helps.

David
DavidAmerland is offline   Reply With Quote
Reply


Currently Active Users Viewing This Thread: 1 (0 members and 1 guests)
 
Thread Tools
Display Modes Rate This Thread
Rate This Thread:

Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

vB code is On
Smilies are On
[IMG] code is On
HTML code is Off
Trackbacks are Off
Pingbacks are Off
Refbacks are Off


All times are GMT. The time now is 06:51 PM.

UK Web Hosting  |  UK Reseller Hosting  |  UK Dedicated Servers UnitedHosting  |  UnitedSupport  |  SEO by vBSEO 3.0.0
Copyright © 1998-2009 United Communications Limited. All Rights Reserved. Registered in England and Wales 3651923 - VAT Reg No. 737662309