This question is in reference to Atlassian Documentation: Exporting Confluence Pages and Spaces to XML
(1) Is there any documentation (or even just a schema) for the exported XML? (2) Is it possible to do a batch export of the "storage format" instead?
There is no published schema. Macros storage format is published on each macro page, but standard page formatting is not published.
The XML export is the storage format. You can also get this using the remote REST API. https://developer.atlassian.com/confdev/confluence-server-rest-api
Thanks, Steven! I'll look into what I can do with the API.
In the meantime, do be aware that the XML export does not give you a clean copy of the storage format: the XML export contains all the versions of the page, lots of metadata, and a lot of indirection (to fish out the actual content of the latest version of a single page, I needed half a page of XQuery).
Another thing: the XML export puts all the content into CDATA sections (which are, moreover, malformed: they close with ]] > instead of ]]>).
The exported XML just doesn't seem suitable for further processing with XML tools. And that, if I can't find a workaround, is very likely to be a show-stopper for us.
(Said in an infomercial voice) Has this ever happened to you? You are on a Zoom/Teams call, looking at a large table, and people are confused where on the table you are pointing (insert video of real...
Connect with like-minded Atlassian users at free events near you!Find an event
Connect with like-minded Atlassian users at free events near you!
Unfortunately there are no Community Events near you at the moment.Host an event
You're one step closer to meeting fellow Atlassian users at your local event. Learn more about Community Events