Create
cancel
Showing results for 
Search instead for 
Did you mean: 
Sign up Log in

Howto import pages from web into confluence?

Sven
Rising Star
Rising Star
Rising Stars are recognized for providing high-quality answers to other users. Rising Stars receive a certificate of achievement and are on the path to becoming Community Leaders.
November 22, 2011

EDIT/

Maybe it could be possible to take the "browser-bookmarklet" (javascript-code) from the bookmark-plugin and enhance it so that all selected content is copied into a new page in confluence (under a selectable page) - not just text (like the existing functionality...)

No developer or 3rd party company out there to create such a nice plugin?

EDIT/

Hello,

every collegue is "finding" regularly interesting informations in the web. But how to get them "with a click" to the correct page in confluence?

I've seen

{cache}+{html}-Macro: Problem: Web page gets parsed/updated regularly, creating much load, Pictures are only linked, not embedded

{html}-Macro: Problems: Pictures only embedded, creating new request on every consultation

"HTML to confluence converter"
http://confluence.atlassian.com/display/DISC/HTML+To+Confluence+Converter

Problem: Looks like a promising start, but pictures are not embedded, needs an php-webserver...

Best solution for the moment: I press "Ctrl+P" in the browser and export the page to pdf. This pdf I attach to a confluence-page. (it gets indexed, all formatting is preserved, not so bad)

But what I dream about is that directly in the browser I mark which part of the page I want to import into confluence and then (like with the bookmarks-bookmarklet) I press a button, select the space/page in confluence and all the text gets inserted, links are preserved and pictures are embedded. So that the content is fully available in confluence without generating any load on consultations...

Any idea how to achieve this?

4 answers

1 accepted

0 votes
Answer accepted
Sven
Rising Star
Rising Star
Rising Stars are recognized for providing high-quality answers to other users. Rising Stars receive a certificate of achievement and are on the path to becoming Community Leaders.
February 16, 2012

Finally I've realized this myself. I've coded a little SOAP-Program in .NET which takes the files from (FireFox -> Save page), attaches the pictures to a newly created page, inserts the HTML-Code from the page (.html) into the confluence page and rewrites all images links so that they point to the attached images.

Was a straightforward process ;-)

Stefan Schlom February 22, 2012

Hello Svenni,

this is great to hear, I was searching for hours for a Plugin u seem to have developed.
I also have to import HTML files attached with lot of images to confluence (Typo3 -> Confluence)

It seems that you have a solution for the time intentsive copy paste method.

Is it possible that you make this plugin public or send it to me by mail ( schlom@intuity.de )

It would be awesome.

best regards

Stefan

Sven
Rising Star
Rising Star
Rising Stars are recognized for providing high-quality answers to other users. Rising Stars receive a certificate of achievement and are on the path to becoming Community Leaders.
February 27, 2012

unfortunately it is part of a bigger program (outlook-plugin) and our policy does not allow to share it.

But as indicated the solution is straightforward (a developper can do it in some hours)

0 votes
Alexander Polonsky July 10, 2012

Check out Mediamark, the Multimedia Social Bookmarking plugin: https://marketplace.atlassian.com/plugins/com.cogniumsystems.confluence.plugins.mediamark .

0 votes
SarahA
Rising Star
Rising Star
Rising Stars are recognized for providing high-quality answers to other users. Rising Stars receive a certificate of achievement and are on the path to becoming Community Leaders.
December 14, 2011

Hallo Svenni

Can you convert your content to Word documents? If so, then you may have a good result by importing the Word documents into Confluence. This blog post may be useful:

http://blogs.atlassian.com/2010/11/technical_writing_wiki_single_source_publishing/

It's a pity that there isn't a reliable way of converting directly from HTML. That's a gap that really does need filling. :)

I hope this helps.

Cheers, Sarah

Sven
Rising Star
Rising Star
Rising Stars are recognized for providing high-quality answers to other users. Rising Stars receive a certificate of achievement and are on the path to becoming Community Leaders.
December 14, 2011

Sarah, thanks for the suggestion. We are talking here about "ordinary webpages", blogs, etc. Just content which is worth to be "completely integrated" into the wiki...

Yes the "word-bridge" could be an idea... Or perhaps a Browser-Plugin which allows to select which parts to captures and where to store in confluence and then automatically exports this part of the page and imports it (by SOAP or JScript-JSON or whatelse method) into confluence...

Just dreaming ;-)

Sven
Rising Star
Rising Star
Rising Stars are recognized for providing high-quality answers to other users. Rising Stars receive a certificate of achievement and are on the path to becoming Community Leaders.
December 14, 2011

Hmm. I did find nothing just some utilities to strip out unwanted parts of a webpage (before printing):

https://addons.mozilla.org/de/firefox/addon/print-edit/

http://www.printwhatyoulike.com/

Now all we need it is "save to word"-printer driver which then automatically uploads the generated word-document into confluence...

Sven
Rising Star
Rising Star
Rising Stars are recognized for providing high-quality answers to other users. Rising Stars receive a certificate of achievement and are on the path to becoming Community Leaders.
December 14, 2011

That costs $$$ and is only for internet explorer but sounds good, too:

http://www.readonweb.com/guide/62.htm

(clean web page from clutter and edit "main content" directly in word...)

Sven
Rising Star
Rising Star
Rising Stars are recognized for providing high-quality answers to other users. Rising Stars receive a certificate of achievement and are on the path to becoming Community Leaders.
December 14, 2011

http://www.interclasse.com/scripts/html2word.php

With some scripting-magic (and using Internet Explorer scripting and Microsoft Word Scripting) it is even possible to automate the task. With the Confluence CLI we could then do the auto-import of the created word-file. But... In this way all "clutter" (publicity, navigation, links...) would be imported into confluence, too.

We have to be able to select first which part of the page we want to copy...

0 votes
Ole Kristensen November 23, 2011

{iframe:width=1000|height=900|src=http://www.atlassian.com}{iframe}

I made the above into a template called iFrames.

You can manage the web by setting White Pages in the Confluence Admin - allowing only certain websites to displayed inside Confluence.

Sven
Rising Star
Rising Star
Rising Stars are recognized for providing high-quality answers to other users. Rising Stars receive a certificate of achievement and are on the path to becoming Community Leaders.
December 13, 2011

Thanks for the answer, I'm searching a way to copy content (including images!).

I don't want that every time somebody reads the information in the wiki, the content is loaded again from the original server. Looks like "print to pdf" in the browser and then attaching the pdf to the page is for the moment the best way... (if I see new services like "www.clipboard.com" there should be better ways to handle this...

Suggest an answer

Log in or Sign up to answer
TAGS
AUG Leaders

Atlassian Community Events