Question

The most recent version of ScraperWiki depends on Poppler (or so the GitHub says). Unfortunately, it only specifies how to get it on macOS and Linux, not Windows.

A quick googling turned up nothing too promising. Does anyone know how to get Poppler on Windows for ScraperWiki?

Was it helpful?

Solution 2

Poppler Windows binaries are available from ftp://ftp.gnome.org/Public/GNOME/binaries/win32/dependencies/ -- but note that those aren't quite up-to-date.

If you're looking for Python (2.7) bindings (as this question's tag suggests), I requested them in the past via this bug report. A couple of people apparently managed to produce something, but I haven't checked those out yet.

As for a more recent (python bindings unrelated) poppler Windows binaries Google result, see http://blog.alivate.com.au/poppler-windows/

Finally, there's the brand-new (and currently very frequently updated) PyGObject all-in-one installer (mainly aiming to provide PyGObject-instrospected Gtk+3 Python bindings etc. for Windows), so if that's what you're looking for, go to http://sourceforge.net/projects/pygobjectwin32/files/?source=navbar

OTHER TIPS

Other answers have linked to the correct download page for Windows users but do not specify how to install them for the uninitiated.

  1. Go to this page and download the binary of your choice. In this example we will download and use poppler-0.68.0_x86.

  2. Extract the archive file poppler-0.68.0_x86.7z into C:\Program Files. Thus, the directory structure should look something like this:

C:
    └ Program Files
        └ poppler-0.68.0_x86
            └ bin
            └ include
            └ lib
            └ share
  1. Add C:\Program Files\poppler-0.68.0_x86\bin to your system PATH by doing the following: Click on the Windows start button, search for Edit the system environment variables, click on Environment Variables..., under System variables, look for and double-click on PATH, click on New, then add C:\Users\Program Files\poppler-0.68.0_x86\bin, click OK.

  2. If you are using a terminal to execute poppler (e.g. running pdf2image in command line), you may need to reopen your terminal for poppler to work.

  3. Done!

Download Poppler Packaged for Windows

https://github.com/oschwartz10612/poppler-windows/releases

I threw together a quick repo with the latest Poppler prebuilt-binaries packaged with dependencies for Windows. Built with the help of conda-forge and poppler-feedstock. Includes the latest poppler-data.

With anaconda installed on windows one can simply execute:

conda install -c conda-forge poppler

UPDATE 2

See the answer by Owen Schwartz.


UPDATE 1

Rumpel Stielzchen's comment:

This site is no longer maintained. Poppler version 0.68 is very outdated today. You find the latest version compiled also for Windows here: https://anaconda.org/conda-forge/poppler/files Sadly there is no 32 bit version, only 64 bit

… but this package contains no dependencies:

It seems that the Anaconda people have a tool to download a package and all dependencies. And there is a file in the TAR package: index.json which lists the package on which it depends. I downloaded the dependencies one by one, and yes: It WAS a pain.


Original answer

Latest Poppler Windows binaries can be found here:

http://blog.alivate.com.au/poppler-windows/

You should consider using Windows Subsystem for Linux (WSL).

  1. Enable WSL on Windows 10 (it will not work on S edition)
  2. Install Ubuntu (latest version) on WSL from the Windows Store
  3. Open Ubuntu command-line

In the Ubuntu Command-line, run the following commands:

  • sudo apt-get update
  • sudo apt-get upgrade
  • sudo apt install poppler-utils
  • pdftocairo -v - to check the installed version

You can then run pdftocairo (for example) in two ways:

  1. Within the Ubuntu command-line: pdftocairo ...
  2. Directly from Windows command-line: wsl pdftocairo...

NOTE: There is a default version of poppler for each release of Ubuntu. You will need to look up the instructions (there should be plenty on the internet), for how to install the latest version of poppler-utils on Ubuntu. This might involve quite a few steps, which will compile from the source code. For example, something like this https://askubuntu.com/a/722955. And then you might get a lot of problems.

The latest version of Ubuntu 19.04, can install Poppler 74. But Ubuntu 18.04 seems to be the latest version you can install for WSL for now, and that installs Poppler 62.

Chocolatey

Poppler is available as Chocolatey package:

choco install poppler

By default Poppler is installed in C:\ProgramData\chocolatey\lib\poppler and shims are automatically created for the following tools: pdfdetach, pdffonts, pdfimages, pdfinfo, pdfseparate, pdftocairo, pdftohtml, pdftoppm, pdftops, pdftotext, pdfunite.

To update Poppler, run:

cup poppler

Scoop

Install from the main bucket:

scoop install poppler

By default Poppler is installed in ~\scoop\apps\poppler and shims are automatically created for the following tools: pdfdetach, pdffonts, pdfimages, pdfinfo, pdfseparate, pdftocairo, pdftohtml, pdftoppm, pdftops, pdftotext, pdfunite.

To update Poppler, run:

scoop update poppler

TeX Live

As mentioned in another answer, MiKTeX currently ships with Poppler tools, and so does another LaTeX distribution, TeX Live. From the guide:

Command-line tools.

A number of Windows ports of common Unix command-line programs are installed along with the usual TeX Live binaries. These include gzip, zip, unzip, and the utilities from the poppler suite (pdfinfo, pdffonts, …)

Poppler suite is located by default in C:\texlive\<year>\bin\win32 and, if you can compile your LaTeX documents, should work out of the box since this location is added to the PATH by the installer.

To Simply install Poppler on Windows run through the below mentioned steps without touching the environmental varible.

  1. Download the Latest Poppler Binary from the URL: http://blog.alivate.com.au/poppler-windows/index.html
  2. Unzip it and copy the poppler-0.68.0_x86 folder in some path for ex, C:/User/Poppler/poppler-0.68.0_x86/poppler-0.68.0/bin

Now go to your Python code where you want to call Poppler for image conversion and use the below mentioned code snippet:

from pdf2image import convert_from_path
pages = convert_from_path('MyPdf.pdf', 500, poppler_path = r'C:\User\Poppler\poppler-0.68.0_x86\poppler-0.68.0\bin')
for page in pages:
    page.save('out.jpg', 'JPEG')

It looks like a version that is build-able with visual studio can be found here https://bitbucket.org/merarischroeder/poppler-for-windows/overview

Up to date binaries for Windows x64, Mac OSX-64, Linux-64bit can be found here

https://anaconda.org/conda-forge/poppler/files

Poppler version 0.84 is available at the link as of this writing which is very current. The accepted answer and the link given by Alexey are no longer pointing to current versions of poppler

Update : As of March 8, 2021 the best answer is by Owen Schwarz above https://stackoverflow.com/a/62615998/590388

Another option is that if you have installed MikTeX then poppler is included by default and is probably already in your PATH. In my case the binaries were installed under: C:\Program Files\MiKTeX 2.9\miktex\bin\x64

MSYS2 has the latest version available for install.

If you don't want to install the whole enviroment (or you wanted some kind of portable version) you could also just download Poppler straight from the repository, but then you'd also have to manually handle dependencies. Namely: libwinpthread, nspr, gcc-libs, nss, curl, brotli, openssl, libidn2, libiconv, gettext, libunistring, nghttp2, libpsl, libjpeg-turbo, lcms2, openjpeg2, libpng, zlib, libtiff, xz and zstd.

  1. Install the Microsoft Visual C++ Build Tools

  2. Install poppler through the Conda prompt conda:

conda install -c conda-forge poppler

please note: if you don't have anaconda installed, it can be downloaded from here, https://docs.anaconda.com/anaconda/install/windows/

Installing Poppler on Windows

  1. Go to https://github.com/oschwartz10612/poppler-windows/releases/
  2. Under Release 21.11.0-0 Latest v21.11.0-0
  3. Go to Assets 3 Download
  4. Release-21.11.0-0.zip

Adding Poppler to path

  1. Add Poppler installed to loaction : C:\Users\UserName\Downloads\Release-21.11.0-0.zip
  2. Add C:\Users\UserName\Downloads\Release-21.11.0-0.zip to system variable path in Environment Variable

This is what I did.

  1. Install msys2

  2. Open msys2 shell and then run:

    pacman -Syu

    pacman -S mingw-w64-x86_64-poppler

  3. Open MingW64 Shell and access poppler binaries

The binaries are installed at: C:\msys64\mingw64\bin

Licensed under: CC-BY-SA with attribution
Not affiliated with StackOverflow
scroll top