Question

i want to parse pdf file such that i can identify each article's title and its description.so that i can store title text and descriptiontext in cache for search purpose.is there any library or tool for this?

Was it helpful?

Solution

First you can install Adobe Pdf IFilter dll. Then you can read properties from any .pdf file using this library (Using IFilter in C#). Or alternatively you can also look into this library (Implementing a TextReader to extract various files contents using IFilter).

Licensed under: CC-BY-SA with attribution
Not affiliated with StackOverflow
scroll top