I have several PowerPoint files that I'd like to upload to Confluence and I'd like Confluence to index the contents of the files. Some of the files are in the older ppt format and some of the files are in the newer pptx format. All of the ppt files index correctly in Confluence. Only the very smallest, single slide, pptx files will properly index in Confluence. I turned on debug logging and I'm seeing this:
Error reading content of PowerPoint document: Document too big for text extraction, bailing out
I'm familiar with the Attachment Size setting and these further settings:
I've tried tweaking those values and I always hit the same error above. Is there a way to tell Confluence to accept large pptx files?
I've been able to work-around the issue by saving my pptx files in ppt format, but this is not ideal.
Hello @Steve Boyle !
As I understand, your instance will not index any *.PPTX file you upload into it.
With this behavior in mind, there are a few things I would like to check with you:
- How big (in Megabytes or Kilobytes) are the *.PPT files that are indexed?
- How big (in Megabytes or Kilobytes) are the *.PPTX files that are not indexed?
- What happens if you save one of the files that are originally *.PPT as *.PPTX and try indexing again?
I am asking about file sizes so I can check with the responsible team if some of these limits also apply to *.PPTX files:
If the uploaded file is one of the following types, Confluence will only extract up to:
- 1 MB of text from Excel (.xlsx)
- 8 MB of text from PDF (.pdf)
- 10 MB of text from other text files (including .txt, .xml, .html, .rtf etc)
- 16 MB of text from Word (.docx)
Looking forward to your reply.
Thank you for your reply. We've been able to index small/simple PPTX files, under about 300KB. When we have PPTX files and then save them into PPT format, the PPT version will always index even when the PPTX version would not. We've been able to index PPT files up to around 10MB.
PPT files, indexed up to at least 10MB (Megabytes)
PPTX files, indexed up to around 300KB (Kilobytes)
If I have a PPTX file and it will not index then I can save it to PPT and it will index. If I take that same PPT file and save it back to PPTX then the PPTX will not index. Can't say I've tried only PPT->PPTX. PPTX->PPT works. PPTX->PPT->PPTX does not work.
Calling all Confluence Cloud Admins! We created a new Community Group to support your unique needs as Confluence admins. This is a group where you can ask questions, access resou...
Connect with like-minded Atlassian users at free events near you!Find an event
Connect with like-minded Atlassian users at free events near you!
Unfortunately there are no Community Events near you at the moment.Host an event
You're one step closer to meeting fellow Atlassian users at your local event. Learn more about Community Events