Howto import pages from web into confluence?

EDIT/

Maybe it could be possible to take the "browser-bookmarklet" (javascript-code) from the bookmark-plugin and enhance it so that all selected content is copied into a new page in confluence (under a selectable page) - not just text (like the existing functionality...)

No developer or 3rd party company out there to create such a nice plugin?

EDIT/

Hello,

every collegue is "finding" regularly interesting informations in the web. But how to get them "with a click" to the correct page in confluence?

I've seen

{cache}+{html}-Macro: Problem: Web page gets parsed/updated regularly, creating much load, Pictures are only linked, not embedded

{html}-Macro: Problems: Pictures only embedded, creating new request on every consultation

"HTML to confluence converter"
http://confluence.atlassian.com/display/DISC/HTML+To+Confluence+Converter

Problem: Looks like a promising start, but pictures are not embedded, needs an php-webserver...

Best solution for the moment: I press "Ctrl+P" in the browser and export the page to pdf. This pdf I attach to a confluence-page. (it gets indexed, all formatting is preserved, not so bad)

But what I dream about is that directly in the browser I mark which part of the page I want to import into confluence and then (like with the bookmarks-bookmarklet) I press a button, select the space/page in confluence and all the text gets inserted, links are preserved and pictures are embedded. So that the content is fully available in confluence without generating any load on consultations...

Any idea how to achieve this?

4 answers

1 accepted

0 votes
Accepted answer

Finally I've realized this myself. I've coded a little SOAP-Program in .NET which takes the files from (FireFox -> Save page), attaches the pictures to a newly created page, inserts the HTML-Code from the page (.html) into the confluence page and rewrites all images links so that they point to the attached images.

Was a straightforward process ;-)

Hello Svenni,

this is great to hear, I was searching for hours for a Plugin u seem to have developed.
I also have to import HTML files attached with lot of images to confluence (Typo3 -> Confluence)

It seems that you have a solution for the time intentsive copy paste method.

Is it possible that you make this plugin public or send it to me by mail ( schlom@intuity.de )

It would be awesome.

best regards

Stefan

unfortunately it is part of a bigger program (outlook-plugin) and our policy does not allow to share it.

But as indicated the solution is straightforward (a developper can do it in some hours)

{iframe:width=1000|height=900|src=http://www.atlassian.com}{iframe}

I made the above into a template called iFrames.

You can manage the web by setting White Pages in the Confluence Admin - allowing only certain websites to displayed inside Confluence.

Thanks for the answer, I'm searching a way to copy content (including images!).

I don't want that every time somebody reads the information in the wiki, the content is loaded again from the original server. Looks like "print to pdf" in the browser and then attaching the pdf to the page is for the moment the best way... (if I see new services like "www.clipboard.com" there should be better ways to handle this...

Hallo Svenni

Can you convert your content to Word documents? If so, then you may have a good result by importing the Word documents into Confluence. This blog post may be useful:

http://blogs.atlassian.com/2010/11/technical_writing_wiki_single_source_publishing/

It's a pity that there isn't a reliable way of converting directly from HTML. That's a gap that really does need filling. :)

I hope this helps.

Cheers, Sarah

Sarah, thanks for the suggestion. We are talking here about "ordinary webpages", blogs, etc. Just content which is worth to be "completely integrated" into the wiki...

Yes the "word-bridge" could be an idea... Or perhaps a Browser-Plugin which allows to select which parts to captures and where to store in confluence and then automatically exports this part of the page and imports it (by SOAP or JScript-JSON or whatelse method) into confluence...

Just dreaming ;-)

Hmm. I did find nothing just some utilities to strip out unwanted parts of a webpage (before printing):

https://addons.mozilla.org/de/firefox/addon/print-edit/

http://www.printwhatyoulike.com/

Now all we need it is "save to word"-printer driver which then automatically uploads the generated word-document into confluence...

That costs $$$ and is only for internet explorer but sounds good, too:

http://www.readonweb.com/guide/62.htm

(clean web page from clutter and edit "main content" directly in word...)

http://www.interclasse.com/scripts/html2word.php

With some scripting-magic (and using Internet Explorer scripting and Microsoft Word Scripting) it is even possible to automate the task. With the Confluence CLI we could then do the auto-import of the created word-file. But... In this way all "clutter" (publicity, navigation, links...) would be imported into confluence, too.

We have to be able to select first which part of the page we want to copy...

Suggest an answer

Log in or Sign up to answer
Community showcase
Posted Oct 24, 2018 in Confluence

Atlassian Research opportunity with Confluence templates

Do you use templates with Confluence? Take part in a remote 1-hr workshop. You'll receive USD $100 for your time!   We're looking for people to participate in a   remote 1-hr workshop...

1,577 views 26 14
Join discussion

Atlassian User Groups

Connect with like-minded Atlassian users at free events near you!

Find a group

Connect with like-minded Atlassian users at free events near you!

Find my local user group

Unfortunately there are no AUG chapters near you at the moment.

Start an AUG

You're one step closer to meeting fellow Atlassian users at your local meet up. Learn more about AUGs

Groups near you