With Confluence Server, Google indexes some bad URLs which shouldn't be visibile to users (like https://myconfluencesite/plugins/recently-updated/).
How can I avoid that?
Is the only solution to try and use a robots.txt and disallow /plugins/?
If so, how do you add a robots.txt to a Confluence Server install?
Any help would be appreciated, thanks!
thanks for your reply and the link. It says that if I can't upload a robots.txt file (which is the case for my setup, unfortunately), then I should use the meta tag. But as I understand it, the meta tag doesn't let me block a URL like https://myconfluencesite/plugins/recently-updated/ from getting indexed, right? Or how would a meta tag look like that does that?
My understanding is that you can block your entire instance from Google's indexing: Block search indexing with 'noindex'. Here are the examples of what the meta tag would look like, which you would paste in under Confluence Admin>Custom HTML>At the end of the head:
To prevent most search engine web crawlers from indexing a page on your site, place the following meta tag into the
<head>section of your page:
<meta name="robots" content="noindex">
To prevent only Google web crawlers from indexing a page:
<meta name="googlebot" content="noindex">
You should be aware that some search engine web crawlers might interpret the
noindexdirective differently. As a result, it is possible that your page might still appear in results from other search engines.
Hi Community! I’m Elaine, Confluence Product Manager. You may have read my earlier post about page tree in space navigation sidebar. I'm excited to share another improvement that helps you organize ...
Connect with like-minded Atlassian users at free events near you!Find a group
Connect with like-minded Atlassian users at free events near you!
Unfortunately there are no AUG chapters near you at the moment.Start an AUG
You're one step closer to meeting fellow Atlassian users at your local meet up. Learn more about AUGs