Vra

so far it's only aspose words but which is very pricey

other are to convert to .pdf or to print to .pdf

I am looking for a way to read the contents of these doc types without installing office or pdf app i.e. get the text of these documents for parsing

Was dit nuttig?

Oplossing

DevExpress offers a document server component now, which is far less pricey than Aspose.

Ander wenke

You want to use components that plug into the IFilter framework, which is what windows uses to index documents for its text search.

For office documents you can use Office 2010 Filter Pack For pdf, you can use a commercial offering such as FoxIt IFilter, which seems fairly priced.

Gelisensieer onder: CC-BY-SA met toeskrywing
Nie verbonde aan StackOverflow
scroll top