Question

Can anyone share experience with parsing XML chunks embedded in a non-XML file?

I am implementing an Edge-Side-Includes[1] processor. Edge-Side-Includes elements are not necessarily embedded in XML- or well-formed XML files and this poses the question, how to go about finding and then parsing such elements.

Has anyone done something similar?

[1] http://www.w3.org/TR/esi-lang

Was it helpful?

Solution

Seems like the best option is to either embed the XML tokenizing into the overall tokenizer or identify the chunks and hand them to an XML parser individually.

Licensed under: CC-BY-SA with attribution
Not affiliated with StackOverflow
scroll top