Though antoyo's answer works, I wasn't too sure why. So, I decided to investigate.
All of my documents are encoded in UTF-8, as are most web-pages. The ß
character has the UTF code point of UTF+00DF.
Since UTF-8 is a variable length encoding, in the binary form, ß
would be encoded as 11000011 10011111
or C3 9F
. Since by default Qt relies on Latin1 encoding. It would read ß
as two different characters. The first one C3
will map to Ã
and the second one 9F
will not map to anything as Latin1 does not recognize bytes in between 128-159 (in decimal).
That's why ß
appears as Ã
when using Latin1 encoding.
Side note: You might want to brush up on how UTF-8 encoding works, because otherwise it seems a little unintuitive that ß
takes two bytes even though its code point DF
is less than FF
and should consume just one byte.