We are creating an ML model that needs the confluence data as training inputs. Please suggest what is the best way to load the confluence document data of my organization to the ML model. We also want to use Confluence data as a corpus of documents for RAG(Retreval Augmented Generation).
Is there any way to generate JSON formatted input files from Confluence pages?
@Disha Patel Welcome to the Atlassian community
It really depends how you need to feed the data. Every time I have done this in the past we have always used the Confluence API. This way we can consume the Confluence pages in a format that we can control. https://docs.atlassian.com/atlassian-confluence/REST/6.6.0/ This does provide you with JSON as well.
This helps. Thank you
You must be a registered user to add a comment. If you've already registered, sign in. Otherwise, register and sign in.
You must be a registered user to add a comment. If you've already registered, sign in. Otherwise, register and sign in.