Is there anything to convert HTML special characters in files to normal characters?
-
11-09-2019 - |
Question
I have some source code files which came to me by an HTML output, so they're pretty unusable.
I have things like this:
%include "macros.mac"
Which should be:
%include "macros.mac"
Is there any script (sh, perl, batch, ...) to convert every file (there are about 200) to the appropriate characters? Characters include & lt;
, & gt;
, ... (I put an space in the middle so that it wont convert them to < and >.
Thank you, it's very appreciated.
Solution
If it is just about the four &"<>
characters, sed(1)
could help:
sed 's/"/"/g; s/</</g; s/>/>/g; s/&/\&/g'
Update: My original proposal was the following and had a bug:
sed 's/&/\&/g; s/"/"/g; s/</</g; s/>/>/g'
This would convert "&gt;" into ">" which is wrong.
OTHER TIPS
you can try a tool like windows grep or textcrawler for this
Licensed under: CC-BY-SA with attribution
Not affiliated with StackOverflow