Wrong search results with Chinese labels

Hi everybody,

One of our customers is experiencing a weird search issue in their Chinese department. I broke the issue down to the most simple case: one page has the label 其他产品 (other products) and one has the label 产品信息 (product information). When I do a label search like labelText:其他产品 both pages are found:

screen-pocketsearch.png

Does anyone have a clue why this happens?

Regards, Felix [Scandio]

2 answers

1 accepted

1 vote
Stephen Deutsch Community Champion Aug 04, 2015

Hi Felix,

Taking a look at the documentation for the tokenizer for Lucene that deals with CJK characters, it seems like it splits up the characters into two-character bundles:

https://lucene.apache.org/core/3_5_0/api/all/org/apache/lucene/analysis/cjk/CJKTokenizer.html

That would explain why it matches the two characters for "product" in both strings.

Hi Stephen,

Yes, we already solved this issue with Atlassian support. It actually works if you put the strings in double quotes. I'll accept your answer wink.

Regards, Felix

Suggest an answer

Log in or Sign up to answer
How to earn badges on the Atlassian Community

How to earn badges on the Atlassian Community

Badges are a great way to show off community activity, whether you’re a newbie or a Champion.

Learn more
Community showcase
Posted Jul 10, 2018 in Confluence

We want to see the templates you've created in Confluence!

Hi Community, Jessica here from the Confluence Product Marketing team!  July’s community challenge is all about sharing pictures  — and as an extension of our first post on what ...

913 views 23 12
Join discussion

Atlassian User Groups

Connect with like-minded Atlassian users at free events near you!

Find a group

Connect with like-minded Atlassian users at free events near you!

Find my local user group

Unfortunately there are no AUG chapters near you at the moment.

Start an AUG

You're one step closer to meeting fellow Atlassian users at your local meet up. Learn more about AUGs

Groups near you