Use the html.unescape()
function (Python 3.4 and newer):
>>> import html
>>> html.unescape('опа')
'опа'
On older versions (including Python 2), you’d have to use an instance of HTMLParser.HTMLParser()
:
>>> from HTMLParser import HTMLParser
>>> parser = HTMLParser()
>>> parser.unescape('опа')
u'\u043e\u043f\u0430'
>>> print parser.unescape('опа')
опа