converting ms word document's special characters to html
-
11-06-2021 - |
문제
I have word document and following code which is converting doc into html using Apache POI Api.
serializer.setOutputProperty(OutputKeys.ENCODING, "UTF-8");
serializer.setOutputProperty(OutputKeys.METHOD, "html");
serializer.transform(domSource, streamResult);
but the numbering i.e 1), a), i) etc. and bullet points characters are not being parsed correctly, I am getting garbage characters like 1? and when I open the html file in the editor I get numbers with unwanted boxes. I have tried a lot but I don't get proper solution of this.
Please help me out in order to get rid of this encoding issue.
Thanks
올바른 솔루션이 없습니다
제휴하지 않습니다 StackOverflow