Finder - DOCman PDFs

Overview

This plugin is responsible for indexing the content in DOCman.  It performs the following operations:

  • Indexes the document title and description.
  • Indexes the category title against the document.
  • Indexes the raw text of associated PDF documents (limited to the first 150k of raw text).

Please note that in order to index PDF files, the server must have the pdftotext binary available.  For more information see the XPDF web site.  Also avoid using file names containing spaces as this may cause pdftotext to fail.

Parameters

This plugin has the following parameters.

  • Path to pdftotext - This is the directory in which pdftotext resides on the server (do not include the binary file or a trailing slash).  Possible values are:
    • /usr/local/bin (*nix)
    • /usr/bin (*nix)
  • Ignore Errors - Optionally ignore the following indexing errors (recommended):
    • The temporary text file could not be created by pdftotext (possibly a permission problem).
    • The PDF source file was not found.
  • Syncronize State - Syncronize the publishing state between indexed links and documents but only when the document changes (recommended).  This means if you unpublish a document, Finder will also unpublish the link and visa-versa.

Additional Comments

The Finder System Plugin includes the appropriate triggers for reindexing DOCman documents when they are saved.