The split()
method by default splits on whitespace. Therefore, strlist
is a list that contains the whole string in strlist[0]
, and one single element.
If you want a list with one element for each unicode codepoint you can do transform it into a list in different ways:
- Function:
list(stru.decode("utf-8"))
- List comprension:
[item for item in stru.decode("utf-8")]
- Not convert at all. Do you really need a list? You can iterate over the unicode string just like over any other sequence type (
for character in stru.decode("utf-8"):
...)