parsing pdf content stream to understand paragraph boundary

https://stackoverflow.com/questions/14903240

pdf
pdfbox
xpdf

09-03-2022
|

문제

Is there a way to parse the pdf content stream and identify paragraph boundary? I read ISO 32000-1:2008 but could not understand if, the pdf content stream contains any operator which tells a display software to start the paragraph, or end it. Can any text extractor software like pdfbox or xpdf provide that information?

올바른 솔루션이 없습니다

라이센스 : CC-BY-SA ~와 함께 속성

제휴하지 않습니다 StackOverflow