Question

I'm looking for a plugin to extend the search into the content of attachments (typically text file like doc or Pdf).

Was it helpful?

Solution

This topic in the WordPress support forums addresses this question, and suggests Google custom search, which should in theory search PDFs and DOCs and would be easy to set up and try, or the Search Everything plugin, which might be what Jeff said doesn't work in his comment above.

There's also the Sphider Wordpress plugin, an search plugin that claims to support indexing PDF and DOC files. It might be more difficult to set up, but it's the best option I can find.

OTHER TIPS

I had this same need (indexing contents of uploaded files) and was amazed there was no plugin that allows this. The above do not work, and searching gave nothing.

So - I wrote one.

It works, we're using it at work. It's not super friendly - takes a little bit of expertise to ensure java & dependency (apache tika) are working - but it does the trick.

I may update it & submit to wordpress plugins - but meanwhile, if it helps anyone:

http://avatari.net/public/wordpress/masala/

Problem: Need to use wordpress search to find text within uploaded pdf files.

Solution:

Step 1. Upload your pdf(s)

Step 2. Under the description of each pdf paste the following:

Click the link above to view PDF

Step 3. Copy the text from the pdf and paste it in the above in place of PDFDOCUMENTTEXT

Step 4. Install Search Everything plugin and then activate. Your searches should now return results including a list of attachment pages that include the text within the pdfs but will direct the user to click on the pdf link.

If anyone else knows of a better way to do this please let me know.

I have found one solution. Maybe it will work for you also :

Masala Plugin : https://github.com/nanodust/masala , which actually add new post meta field while uploading any doc/pdf/ppt/txt file. It will put the file content into newly created meta field

This plugin uses : 1) Java 2) Apache Tika – really awesome 3) Search Everything plugin which is responsible for search in meta table.

Here, you need to configure : 1) Java path 2) Download latest apache tika jar and you can put into your project root folder and configure path in masala.php 4) Give the allowed file extensions.

ITS DONE !!

Licensed under: CC-BY-SA with attribution
Not affiliated with StackOverflow
scroll top