You're on your way to the next level! Join the Kudos program to earn points and save your progress.
Level 1: Seed
25 / 150 points
Next: Root
1 badge earned
Challenges come and go, but your rewards stay with you. Do more to earn more!
What goes around comes around! Share the love by gifting kudos to your peers.
Keep earning points to reach the top of the leaderboard. It resets every quarter so you always have a chance!
Join now to unlock these features and more
The Atlassian Community can help you and your team get more value out of Atlassian products and practices.
I want to download all the 800k pages of my Confluence wiki.
I'd like to use:
curl -u wikiusername:wikipassword https://wiki.hostname.com/rest/api/content?start=1`
and simply increase start
from 1
to 800000
.
However, the response time increases as start
increases, and from ~80000
begins to timeout:
start | response time (seconds) |
---|---|
1 | 0.4 |
1,000 | 2.5 |
10,000 | 9 |
50,000 | 112 |
100,000 | timeout |
How can I use rest/api/content
to download all the 800k pages of my Confluence wiki without timing out?
This is because when you make the REST call, the server is trying to build the response in memory before it can send it back. If you make it too large, the process will fail.
You're going to need to page through what you're trying to download, you can't do it in one massive great chunk (unless you increased the server memory to something massive)
I'd also want to quickly question why? What is this download going to do for you? I'm thinking there may be a better option (like parsing a backup)
You must be a registered user to add a comment. If you've already registered, sign in. Otherwise, register and sign in.