Difference between Collator (locale-sensitive) and compareTo (lexicographically) for comparing String values

StackOverflow https://stackoverflow.com/questions/8885885

  •  29-10-2019
  •  | 
  •  

Question

I've been reading about using Collator and the compareTo method in String for comparing Strings. I'm unsure what the real difference is between the two from reading the API. When is one to prefer over the other?

API Collator

API String compareTo

Was it helpful?

Solution

Promoted from my comment (which sort of half answers the question):

Use a collator: Suppose you have a contact manager for a company that has international locations. Suppose you have an autocomplete with prefix matching. The collator can allow your US employees to find a match on accented vowels in names without typing in the accents.

Use compareTo: when you don't care about situations like I just gave.

OTHER TIPS

Basically, locale-sensitive means that it takes into account the language being used and may use different weights for comparisons between different characters.

"For example, in Czech, "e" and "f" are considered primary differences, while "e" and "ě" are secondary differences, "e" and "E" are tertiary differences and "e" and "e" are identical." 1

With the lexicographical comparison of compareTo it just uses their Unicode values instead of taking these different weights into account.

"For comparing Strings exactly once, the compare method provides the best performance. When sorting a list of Strings however, it is generally necessary to compare each String multiple times. In this case, CollationKeys provide better performance. The CollationKey class converts a String to a series of bits that can be compared bitwise against other CollationKeys. A CollationKey is created by a Collator object for a given String. "1

1 Colator Javadoc

Licensed under: CC-BY-SA with attribution
Not affiliated with StackOverflow
scroll top