Does anybody know what happens if you try to input the character not defined in UTF-8 to JIRA?

Daisuke Niwa December 21, 2011

Hello there,

Anyone knows what happens if you input a character which is not defined in UTF-8 on JIRA, say issue description or issue summary?

Does that letter appear garbled?

3 answers

1 accepted

1 vote
Answer accepted
NielsJ
Rising Star
Rising Star
Rising Stars are recognized for providing high-quality answers to other users. Rising Stars receive a certificate of achievement and are on the path to becoming Community Leaders.
December 21, 2011

UTF-8 is an encoding for the Unicode character set. Unicode contains about 109 thousand characters from 93 scripts. It is very likely that your character is covered by Unicode, too ;-)

To find your character you can use this page:
http://www.unicode.org/standard/where/

How to enter Unicode characters is explained at Wikipedia:
http://en.wikipedia.org/wiki/Unicode_input

Daisuke Niwa December 21, 2011

Hello Jamie and Niels,

Thank you for your respective answers. You are right, utf-8 is not a character set, ut an encoding.

So I should have asked "what happens if you input a character not defined in the Unicode?" instead.

As Niels pointed out, it is unlikely to happen, since Unicode is such a big character set, but my customer is worried about that hypothetical possibility.

I'd appreciate it if you could give input on this scenario.

Best regards,

Daisuke Niwa

JamieA
Rising Star
Rising Star
Rising Stars are recognized for providing high-quality answers to other users. Rising Stars receive a certificate of achievement and are on the path to becoming Community Leaders.
December 21, 2011

Well that's why I asked what character you were thinking of... it's so unlikely as to be a hypothetical question only.

Daisuke Niwa December 22, 2011

Hello Jamie,

Understood. Let me confirm with the customer.

Regards,

Daisuke Niwa

1 vote
JamieA
Rising Star
Rising Star
Rising Stars are recognized for providing high-quality answers to other users. Rising Stars receive a certificate of achievement and are on the path to becoming Community Leaders.
December 21, 2011

UTF-8 is an encoding, not a character set. Do you mean input a character not in the Unicode character set? If so, which one?

0 votes
Andy Brook [Plugin People]
Rising Star
Rising Star
Rising Stars are recognized for providing high-quality answers to other users. Rising Stars receive a certificate of achievement and are on the path to becoming Community Leaders.
December 22, 2011

There are many private code-points in Unicode that can be used arbitrarily by one party but may mean something else to someone else, so agreement would need to be made for 'meaning' to be transferrable. I ahve to say I've yet to ever come accross this, and Unicode is generally the best there is at character representation today.

- http://en.wikipedia.org/wiki/Unicode

- http://en.wikipedia.org/wiki/Private_use_characters

Suggest an answer

Log in or Sign up to answer
TAGS
AUG Leaders

Atlassian Community Events