Question

Is there any java library which can be used for converted Microsoft Word files (doc/docx) to Open Document Text format(.odt) formats. Free library would be preferable.

Was it helpful?

Solution

I don't know about any libraries that do it directly, but it should be relatively easy to exact the bits you're interested from a .docx using poi:

http://poi.apache.org/

and then write them to an ODT format using ODFDOM:

http://incubator.apache.org/odftoolkit/odfdom/index.html

This should be relatively straightforward for simple documents, but if your use case calls for complex doucments containing pictures etc, this might become a LOT harder.

Anyway, hope this helps at least some ;)

OTHER TIPS

I believe everything you need is in this post: http://angelozerr.wordpress.com/2012/12/06/how-to-convert-docxodt-to-pdfhtml-with-java/

For instance:

JODConverter : JODConverter automates conversions between office document formats using OpenOffice.org or LibreOffice. Supported formats include OpenDocument, PDF, RTF, HTML, Word, Excel, PowerPoint, and Flash. It can be used as a Java library, a command line tool, or a web application.

Licensed under: CC-BY-SA with attribution
Not affiliated with StackOverflow
scroll top