Mapping Windows-1252 encoding to UTF8 in C
-
14-11-2019 - |
Question
How do you programmatically convert a C-string with encoding Windows-1252 (a.k.a. CP1252) to UTF8?
Solution
On Windows? First call MultiByteToWideChar
and then WideCharToMultiByte
.
On a Mac or Linux: call iconv_open
and then iconv
as needed.
In general: incorporate the ICU4C library.
OTHER TIPS
You need to use a code page conversion, which is language/paltform specific.
If you want to implement it: Windows-1252 on Wikipedia (there is a link to the conversion table)
Some solutions:
- Windows: MultiByteToWideChar
- python: decode()
- Linux: libiconv (just one of the many)
etc...
Licensed under: CC-BY-SA with attribution
Not affiliated with StackOverflow