Create
cancel
Showing results for 
Search instead for 
Did you mean: 
Sign up Log in

Next challenges

Recent achievements

  • Global
  • Personal

Recognition

  • Give kudos
  • Received
  • Given

Leaderboard

  • Global

Trophy case

Kudos (beta program)

Kudos logo

You've been invited into the Kudos (beta program) private group. Chat with others in the program, or give feedback to Atlassian.

View group

It's not the same without you

Join the community to find out what other Atlassian users are discussing, debating and creating.

Atlassian Community Hero Image Collage

Is it possible to remove information from Page History versions via editing the database?

CSO is requiring us to remove plain text passwords from our Confluence pages and all historical versions of the pages.  We do not want to delete the versions and lose all additional historical information, so are wondering if the version history is accessible via the database?  If so, can we remove the password information directly from each version by editing the database directly?

Thanks. 

2 answers

1 accepted

0 votes
Answer accepted

You can do it in the database, but it's likely to be slow, painful and a huge amount of work.

You'll need some way to search content to identify where a password has been put in.  That's always going to be clunky, because there's no clean and simple way to do it that doesn't come down to "read each page and its history, looking for suspect lines"

However you do that, you'll need to end up with a list of pages and historical versions that contain the unwanted text.

Then you'll need to identify each line in the content table that you want to amend, then stop Confluence, go into the massive block of densely encoded xml and edit out the snippet of text you want to remove (without messing up the xml structure) for each entry, and then restart Confluence and re-index it before letting people back in.

This is not a minor undertaking, and SQL is probably the worst way to do it.  While "find" is a problem no matter what you do, I'd strongly recommend telling CSO that the only practical option is to simply destroy the history.  It's then up to them to decide if they want you to do many days of find and delete, or many months of find and edit.  

Or, better, get an automation or scripting tool that runs on the front end.  If you can identify the exact text, things like Scriptrunner for Confluence could do a lot of the work for you.

Everything is in the database as far as content goes, so yes you would be able to do it, question is how do you intent on doing that if you do not know the passwords upfront (i.e. how do you identify what is or isn't a password)?

The passwords are actually in the historical version of the pages, in plain text format, so someone would have to gather all passwords to be removed and the version that they exist on.

Suggest an answer

Log in or Sign up to answer
TAGS
Community showcase
Published in Confluence Cloud

Get to know the Confluence team!

Go “behind the screen” to meet some of the Confluence Cloud team. In this video series, we tackle some of the hard-hitting questions you never knew you wanted the answer to!  Meet some of the ...

236 views 0 10
Read article

Community Events

Connect with like-minded Atlassian users at free events near you!

Find an event

Connect with like-minded Atlassian users at free events near you!

Unfortunately there are no Community Events near you at the moment.

Host an event

You're one step closer to meeting fellow Atlassian users at your local event. Learn more about Community Events

Events near you