dot net:i need to parse pdf file to get each article with its title?

https://stackoverflow.com/questions/1058208

.net
pdf
parsing
title
ifilter

21-08-2019
|

Question

i want to parse pdf file such that i can identify each article's title and its description.so that i can store title text and descriptiontext in cache for search purpose.is there any library or tool for this?

Solution

First you can install Adobe Pdf IFilter dll. Then you can read properties from any .pdf file using this library (Using IFilter in C#). Or alternatively you can also look into this library (Implementing a TextReader to extract various files contents using IFilter).

Licensed under: CC-BY-SA with attribution

Not affiliated with StackOverflow