What is the best way of extracting data from confluence?

We use Confluence as a knowledge management application as it grows in size the content is becoming harder to manage. What is the best way to extract data such as:

Broken links

Pages with no incomming links

Pages containing links to external sites

What pages contain a link to a certain attachment/image

Map of a module or space (similar to BPMN)

User (what they clicked on within a time span)

Top page hits

Attachments (were is linked to, where is it displayed)

Any halp on this would be appreciated.

Thank you

2 answers

1 accepted

0 votes
Answer accepted

Many of the items in your list of data to extract can be answered by integrating Google Analytics within Confluence. This is pretty straightforward and has been documented several times, including:



Some info, such as the attachment info can be had by writing a user macro.



Hi Natalie,

some days ago I came across this blogpost: http://www.kikamaca.com/2012/03/managing-content-in-confluence-statistics-and-reporting/

Maybe this is useful for you.


This blog post no longer exists.

Suggest an answer

Log in or Sign up to answer
Community showcase
Published Mar 12, 2019 in Confluence

Confluence Admin Certification now $150 for Community Members

More and more people are building their careers with Atlassian, and we want you to be at the front of this wave! Important Dates Start the Certification Prep Course by 2 April 2019 Take your e...

292 views 2 12
Read article

Atlassian User Groups

Connect with like-minded Atlassian users at free events near you!

Find a group

Connect with like-minded Atlassian users at free events near you!

Find my local user group

Unfortunately there are no AUG chapters near you at the moment.

Start an AUG

You're one step closer to meeting fellow Atlassian users at your local meet up. Learn more about AUGs

Groups near you