Tek-Tips is the largest IT community on the Internet today!

Members share and learn making Tek-Tips Forums the best source of peer-reviewed technical information on the Internet!

  • Congratulations Mike Lewis on being selected by the Tek-Tips community for having the most helpful posts in the forums last week. Way to Go!

Removing our xml sitemap from google results

Status
Not open for further replies.

ben1234zz

MIS
May 12, 2008
71
GB
Hi

We added a link from our HTML sitemap to our XML sitemap on google and are now finding that the XML sitemap is returned in google search results (not good as it just looks like code).

How can I remove the sitemap from google results, but ensure that google still crawls though it?

Thanks
B
 
1. Create a robots.txt file that tells Google to ignore the sitemap file.

2. Use the Google webmaster tools to request removal from its index.

--
Tek-Tips Forums is Member Supported. Click Here to donate

<honk>*:O)</honk>

Tyres: Mine's a pint of the black stuff.
Mike: You can't drink a pint of Bovril.


 
Hi

Thanks for your post, will this mean that the sitemap itself will not be searched by google or will it follow the links in the map but not show it in the results?

Regards

B
 
Hmm, I'm not sure that if you create a robots file blocking the xml sitemap whether Google Sitemaps will then grab and crawl it. Logic tells me it won't.

However if you read up on the robots protocol you can ask the visiting agent to crawl but not index the sitemap file (I think).

There is a handy tool in Google Webmasters for testing robots files.

You also may just be able to ask Google to drop the file from it's index using the Webmaster tools without making the robots file. Though I believe you are supposed to generate a 404 or block access to the file.

--
Tek-Tips Forums is Member Supported. Click Here to donate

<honk>*:O)</honk>

Tyres: Mine's a pint of the black stuff.
Mike: You can't drink a pint of Bovril.


 
great question, I can't for the life of me think of an answer =). It's a common situation though. I'd try adding the sitemap=blahblah line into robots.txt followed by noinde instruction for the file on the next line. Failing that probably some .htaccess trickery would do it
 
Status
Not open for further replies.

Part and Inventory Search

Sponsor

Back
Top