Performant way to store interim PDF documents in document aggregation?
-
12-10-2019 - |
Question
I have to load a few hundred documents of various sorts from disk and produce one giant aggregated PDF.
My approach is to convert each document to PDF individually, then aggregate them.
I have been writing each PDF document to isolated storage as part of this process.
Is there are better way to do this?
Performance is a priority - this is client side.
Solution
Have you heard of Pdftk - http://www.pdflabs.com/tools/pdftk-the-pdf-toolkit/ (free) - I use it to merge multiple pdfs into one.
In combination with BullZip PDF printer (free) - http://www.bullzip.com/products/pdf/info.php - I can basically create any PDFs I need.
I'm not sure if you can do any automation with BullZip, but with Pdftk you could certainly work with it through C# and System.Diagnostics.Process.
Here's a typical command line that I use:
pdftk folder1/file1.pdf folder1/file2.pdf folder2/file1.pdf cat output all_files.pdf
That merges the 3 files, in order, to all_files.pdf