Forums

Articles
Create
cancel
Showing results for 
Search instead for 
Did you mean: 

How to read tables of the page, not the whole content

Pankaj Kumar
I'm New Here
I'm New Here
Those new to the Atlassian Community have posted less than three times. Give them a warm welcome!
May 20, 2022

Screenshot 2022-05-21 at 12.37.57 AM.png

API I am using is like -> https://my_domain.atlassian.net/wiki/spaces/MT/pages/123/mytesttable | python -mjson.tool

I have an example, like above on the confluence page. By using API I can read this page. But the HTML returned from the confluence API is too large and the whole HTML page. I want to read only the table (or tables) of the page. The response can be HTML or JSON, I am fine with both. 

Could you please help. 

1 answer

1 vote
Prince Nyeche
Rising Star
Rising Star
Rising Stars are recognized for providing high-quality answers to other users. Rising Stars receive a certificate of achievement and are on the path to becoming Community Leaders.
May 20, 2022

You need to extract the content of the HTML page. This is not something you would easily do on a command line in shell. You will need to properly script a solution using a HTML parser (probably in python) and get the content of each column within the confluence page content of that table. Then you can translate the output to whatever format you prefer.

Suggest an answer

Log in or Sign up to answer
TAGS
AUG Leaders

Atlassian Community Events