How to find duplicate attachments for my Confluence instance?

Hi,

Could you please suggest how could I find all the duplicate attachments within my Confluence instance?

any specific SQL query to search it in database, as my query results includes versions also.

Thanks and Regards,

Jenin C M

2 answers

1 accepted

To exclude versions - get the latest version only, you should append prevver IS NULL in your WHERE statement.

If you could provide your SQL query, that would be easier for us to help tweaking it. :)

Hi Husein,

Thanks for your response!!

The query I am trying to fetch the duplicate attachments is:

select distinct title, pageid from attachments where attachmentid in (select distinct(a.attachmentid) from attachments a inner join attachments b on a.title = b.title and a.filesize=b.filesize and a.pageid=b.pageid where a.attachmentid <> b.attachmentid);

This query is generating a very large list of attachments (it includes all the versions) but I need only duplicated attachments i.e same pageid, filesize, content but different attachmentid.

Your response will be much appreciated.

Regards,

Jenin

 

You just want to add "count(*) as Kount", group by title, filesize, and "having kount > 0"

Suggest an answer

Log in or Sign up to answer
Community showcase
Posted Oct 11, 2018 in Confluence

What are your project planning tips?

Hello Community,  Jessica here from the Confluence product marketing team! Today I wanted to get your takes on project planning –– what works, what doesn’t, how do you know if you’re doing it r...

369 views 2 4
Join discussion

Atlassian User Groups

Connect with like-minded Atlassian users at free events near you!

Find a group

Connect with like-minded Atlassian users at free events near you!

Find my local user group

Unfortunately there are no AUG chapters near you at the moment.

Start an AUG

You're one step closer to meeting fellow Atlassian users at your local meet up. Learn more about AUGs

Groups near you