Create
cancel
Showing results for 
Search instead for 
Did you mean: 
Sign up Log in

Data Center Love Month: OpenSearch for Confluence

Anubhav Dutt
Atlassian Team
Atlassian Team members are employees working across the company in a wide variety of roles.
August 30, 2024

Hello Data Center Community,

Happy Data Center Love Month!

I’m Anubhav, a product manager in Atlassian Data Center and I am excited to announce a new enhancement in Confluence designed to address the challenges of slow search performance, particularly with increase in usage and growth in instance size.

Confluence uses Lucene's search library for powering search as default. Lucene, while a robust low-level search library, is not inherently designed to be scale-resilient. This limitation of Lucene becomes more apparent in large instances.

Enter: OpenSearch

OpenSearch is a popular enterprise-grade dedicated search platform that creates a powerful search capability based on distributed processing architecture backed by the robustness of Lucene’s search library.

With the latest Confluence Data Center 9.0 release we now support OpenSearch if you opt-in to switch from the default Lucene based search implementation.

Search in Confluence is now faster:

Confluence_Search_Lucene.gif

 

Our internal performance benchmarking results show OpenSearch is ~4.5x faster than a comparable Lucene-based search and this performance gain only widens with large dataset.

The results:

image-20240715-233943.png

We tested this on an internal 3 node Confluence instance for benchmarking a 36GB index using Lucene(8 CPU, 32GB RAM) against OpenSearch(2 CPU, 16 GB RAM).

Its not just search that benefits from this feature, below are several reasons for why we believe this is a game changer when it comes to overall Confluence performance experience for our large enterprise customers:

  • Better performance - Search is faster and other functions that depend on Lucene access for e.g. page moves, may experience gains

  • Scale with confidence - As your organization grows in size and scale, take advantage of OpenSearch’s horizontal scaling capabilities and let Confluence grow with you.

  • Reduced overhead - Simplified index management lowers overall Confluence resource consumption.

  • High availability - Search without interruption on clustered OpenSearch instances during reindexing.

Our exploration and thorough tests have confirmed that OpenSearch can significantly improve scale related performance issues, however we also understand that switching to OpenSearch may not be immediate for some customers.

OpenSearch is not the default search platform for Confluence.

To begin to use it, you will need to setup and configure it as first-step.

Visit OpenSearch configuration guide containing links to OpenSearch resources.

Search is a core function for any enterprise knowledge management solution and the wide adoption of Confluence has meant that we keep enhancing and improving the Confluence Search features for our customers. In the recent past we introduced space re-index with Confluence DC 8.4 to ensure users can promptly access the most up-to-date content and modifications in real-time fostering a seamless and uninterrupted collaboration experience among teams.

For more details on OpenSearch refer to Confluence DC 9.0 release notes.

To enter for your chance to win Data Center exclusive swag*, leave us a comment or question in the comment section below.

*At this time, swag is available for US shipping only. Winners will be notified at the end of the month.

4 comments

Comment

Log in or Sign up to comment
Gonchik Tsymzhitov
Community Leader
Community Leader
Community Leaders are connectors, ambassadors, and mentors. On the online community, they serve as thought leaders, product experts, and moderators.
August 30, 2024

To be honest OpenSearch looks very promised and provided in AWS :) 

Like # people like this
Eddy G
Contributor
August 31, 2024

Very much looking forward to improved search performance in Confluence. 🙂

Quoting from the OpenSearch docs:

There are many ways to design a cluster. The following illustration shows a basic architecture that includes a four-node cluster that has one dedicated cluster manager node, one dedicated coordinating node, and two data nodes that are cluster manager eligible and also used for ingesting data.

Will Atlassian be providing any recommendations (or at least an example or two) for how to size and configure an OpenSearch cluster (in terms of CPU, memory, disk space) for ”good” performance based on the size of the Confluence instance? (Especially for customers running on-prem and not AWS?)

Thanks!

 

Like # people like this
Stephen Garber
Rising Star
Rising Star
Rising Stars are recognized for providing high-quality answers to other users. Rising Stars receive a certificate of achievement and are on the path to becoming Community Leaders.
September 2, 2024

This is great, search has been a pain point for confluence for a while. It can be slow and produce low quality results.

Moving pages in bulk can also take a while.

It's a great idea to move searching off the main instance!

Like # people like this
Anubhav Dutt
Atlassian Team
Atlassian Team members are employees working across the company in a wide variety of roles.
September 2, 2024

Thanks @Eddy G  for your comment and sharing your excitement !!

 

Regarding the size and configuration we have tried to address that here.

However this is based on Amazon AWS setup that we tested with primarily. My suggestion would be to setup a similar configuration on-premise at the minimum and scale up or down depending on specific usage.

 

Hoping that helps.

Anubhav Dutt

PM - Atlassian DC

Eddy G
Contributor
September 5, 2024

I'm not sure showing managed OpenSearch nodes really helps folks running on-prem. Those instances, and their OpenSearch settings, are all preconfigured by Amazon.

It would be much more helpful to show the details of each node for several OpenSearch configuration options (based on the number of users, size of the Confluence instance, etc.) when OpenSearch is run in an on-prem Kubernetes cluster, for example.

Kristin Lyons
Rising Star
Rising Star
Rising Stars are recognized for providing high-quality answers to other users. Rising Stars receive a certificate of achievement and are on the path to becoming Community Leaders.
September 3, 2024

Reduced overhead - Simplified index management lowers overall Confluence resource consumption.

I love this, we are always looking for ways to ensure we are scaling both Jira and Confluence properly so having reduced overhead is huge for us!

Like # people like this
Dave Liao
Community Leader
Community Leader
Community Leaders are connectors, ambassadors, and mentors. On the online community, they serve as thought leaders, product experts, and moderators.
September 6, 2024

I’m here for the swag!

Just kidding, I’m actually curious about how much OpenSearch can help my users… 🔎

TAGS
AUG Leaders

Atlassian Community Events