Question

I'm downloading HTML source code of remote page into String variable. Unfortunetely the page is encoded via iso-8859-2 and contains characters from polish alphabet. How can I convert this string to utf-8, so I can display it's parts in TextView?

Thanks

Was it helpful?

Solution

You shouldn't need to "convert" the string at all, if you obey the Content-Encoding header sent by the web server.

Right now, you probably ignore that header while reading the response from the server (some BufferedReader-to-StringBuffer/Builder loop I assume), try this in your download code instead:

HttpResponse response = ....
String text = EntityUtils.toString(response.getEntity());

EntityUtils will automagically use the content encoding specified by the server.

Licensed under: CC-BY-SA with attribution
Not affiliated with StackOverflow
scroll top