Community
Q&A
Confluence
Questions
How to scrape the pages requiring login?

How to scrape the pages requiring login?

I am trying to crawl through the wiki pages but the authentication required is not allowing me to get the form field names or anything. I was wondering if there is a better way to do this. Thanks!

1 answer

0 votes

If the web interface wants you to authenticate (to login), as that page does not allow anonymous access, then your web scraper also needs to authenticate (as that is essentially doing the same thing: getting an HTTP response about that Confluence page).

I suggest you follow the Confluence REST API way, but even in that case: you need to authenticate.

You must be a registered user to add a comment. If you've already registered, sign in. Otherwise, register and sign in.

A pop-up survey could appear while you're here --curious what it's for? Click here to learn more!

Forums

Q&A

Community resources

Support

Top groups

Community resources

Support

Learn

Community resources

Support

Events

Community resources

Support

How to scrape the pages requiring login?

1 answer

Suggest an answer

Was this helpful?

Thanks!

TAGS

Atlassian Community Events