Community
Q&A
Confluence
Questions
How to make a robots.txt that disallows 'label' paths

How to make a robots.txt that disallows 'label' paths

I have a robots.txt on my Confluence site and would like to disallow crawlers from crawling the labels. I've got the following DISALLOW lines in my robots.txt but it doesn't appear to be working:

Disallow: /label/
Disallow: /labels/
Disallow: /*label*

I borrowed the last line from https://github.com/childnode/atlassian-confluence-config/blob/master/robots.confluence.txt - I thought DISALLOW didn't support regex, but I threw it in there anyway.

Does anyone know a better path to disallow that would prevent the label pages from being crawled?

(By the way, I know it's not working because I have a crawler that is crawling the site. It is possible there is a bug in the crawler, but is respecting all my other rules so it seems unlikely that it would just arbitrarily ignore a valid rule.)

Thanks in advance for any help.

1 answer

0 votes

The robots.txt looks correct to me. I don't have ways to test it at this moment, but just wondering, have you restarted the application so that it can pick up the new settings? Cheers

You must be a registered user to add a comment. If you've already registered, sign in. Otherwise, register and sign in.

Forums

Q&A

Community resources

Support

Top groups

Community resources

Support

Learn

Community resources

Support

Events

Community resources

Support

How to make a robots.txt that disallows 'label' paths

1 answer

Suggest an answer

Was this helpful?

Thanks!

TAGS

Atlassian Community Events