WebEquipe PDF Search

Description

WebEquipe PDF Search indexes your PDF files and makes their text searchable. When visitors search your site, they see results from both your posts/pages and the content inside your PDFs.

Video

Watch the setup and usage guide: https://www.youtube.com/watch?v=YKdGUjkK4bA

Supported PDFs

  • Works with: Standard, text-based PDFs (the kind you create or export from Word, Google Docs, etc.). File size default 50MB, configurable up to 500MB in settings.
  • Does not work with: Scanned PDFs or image-only PDFs (no extractable text)—use OCR elsewhere first. Password-protected PDFs cannot be indexed.

Keep Private PDFs Out of Search

Need to hide or protect certain PDFs? Use Exclude so a PDF is never indexed and never appears in search—even when you run “Re-index All PDFs” or bulk index. Excluded PDFs stay in your Media Library; they just won’t be searchable. Use Include later to allow indexing again. You can exclude or include PDFs from the Media Library or from Settings PDF Search (Manage PDFs).

How to Use

  1. Install and activate the plugin.
  2. Go to Settings PDF Search.
  3. Click Re-index All PDFs to index existing PDFs (new uploads are indexed automatically by default).
  4. Use your site’s search or add the shortcode [webequipe_pdf_search_form] on a page—PDFs will appear in results.

Existing PDFs need to be indexed once; after that, new uploads can be indexed automatically (optional in settings). To hide specific PDFs from search, use Exclude on them (Media Library or Settings PDF Search Manage PDFs).

Settings at a Glance

All options are under Settings PDF Search:

  • General – Turn indexing on/off, include PDFs in WordPress search, maximum file size (50MB default), excerpt length.
  • Indexing options – Batch size for re-indexing.
  • Search display options – Show or hide PDF icon, file size, page count, last updated date, and thumbnail preview in search results.
  • Advanced – Debug logging, memory/timeout for large PDFs, background processing, delete data on uninstall.

Full details and shortcode options: Settings PDF Search Help.

What You Can Do

  • Full-text search – Search inside PDF content, not just filenames.
  • Control each PDF – Index, unindex, or exclude from the Media Library or the PDF list on the settings page.
  • Bulk actions – Index, unindex, include, or exclude multiple PDFs at once.
  • Search display – Show PDF icon, file size, page count, and excerpts (configurable in settings).
  • Shortcode – Add a PDF search form with [webequipe_pdf_search_form] (see Settings PDF Search Help for options).
  • Background processing – Large PDFs are processed in the background to avoid timeouts.

Troubleshooting

PDFs not appearing in search

Ensure PDFs are indexed (Media Library “Search Indexed” column), “Enable Search Integration” is on in Settings PDF Search, and the PDF is not excluded. See the FAQ for more.

Indexing fails or times out

Use Settings PDF Search Advanced: enable “Background Processing” for large PDFs. Ensure your server PHP memory_limit and max_execution_time are sufficient for very large files (see Help for details). Reduce “Batch Size” if re-indexing many PDFs at once.

Other issues

See the FAQ above and Settings PDF Search Help for full documentation.

Privacy

The plugin stores extracted PDF text and metadata in a custom database table and a compressed backup in WordPress post meta for PDF attachments. If debug logging is enabled, it stores recent log entries in a WordPress option. It does not collect or send visitor data. If your PDFs contain personal or sensitive information, that content is in the index; mention this in your privacy policy if required.

Third-Party Libraries

  • smalot/pdfparser (LGPL-3.0) – PDF text extraction
  • symfony/polyfill-mbstring (MIT) – multibyte string support

Credits

Developed by WebEquipe. Uses smalot/pdfparser for PDF text extraction.

Support

  • Support: https://wordpress.org/support/plugin/webequipe-pdf-search

Screenshots

  • Settings page – indexing stats, options, shortcode, and PDF list
  • Manage PDFs – list, search, filters, pagination
  • Media Library – “Search Indexed” column (indexed, not indexed, excluded)
  • Single PDF – Index, Unindex, Exclude actions
  • Bulk actions – Index, Unindex, Include, Exclude
  • Search results – PDFs with icon, size, page count, excerpts
  • Shortcode search form on a page

Installation

From WordPress Admin

  1. Go to Plugins Add New.
  2. Search for “WebEquipe PDF Search”, install, and activate.

Manual Install

  1. Download the plugin zip.
  2. Go to Plugins Add New Upload Plugin, upload the zip, then install and activate.

After Activation

  1. Go to Settings PDF Search.
  2. Click Re-index All PDFs to index existing PDFs.
  3. New uploads are indexed automatically by default. Use Settings PDF Search Help for full documentation.

FAQ

What kind of PDFs are supported?

Standard, text-based PDFs (e.g. exported from Word or Google Docs). Default max size 50MB (up to 500MB in settings). Scanned or image-only PDFs have no extractable text—use OCR first. Password-protected PDFs cannot be indexed.

Why don’t my PDFs appear in search?

  1. Ensure they are indexed: in Media Library, check the “Search Indexed” column (green check = indexed).
  2. If not indexed, use the “Index” action on the PDF or bulk “Index PDFs”.
  3. Ensure Enable Search Integration is on in Settings PDF Search for normal site search. The shortcode works even when this is off.

How do I hide or protect private PDFs from search?

Use Exclude on the PDF (Media Library or Settings PDF Search). Excluded PDFs are never indexed and never appear in search, even after “Re-index All PDFs”. Use Include to allow indexing again, then Index to index them.

What’s the difference between Unindex, Exclude, and Include?

  • Unindex – Removes the PDF from search for now. You can index it again anytime (e.g. “Index” or “Re-index All PDFs”).
  • Exclude – Keeps the PDF out of indexing permanently until you clear it. “Re-index All PDFs” and bulk “Index PDFs” skip excluded PDFs. Use for private or sensitive files.
  • Include – Clears the exclude flag so the PDF can be indexed again. You still need to run “Index” or “Index PDFs” after including.

How do I index or re-index many PDFs at once?

Media Library: Select the PDFs Bulk Actions “Index PDFs” (or “Unindex”/”Exclude”/”Include”) Apply.

Settings page: Go to Settings PDF Search Manage PDFs Scan PDFs select PDFs choose bulk action Apply.

Does it work with scanned PDFs?

No. The plugin extracts text from PDFs. Scanned PDFs are images; you need OCR elsewhere first, then use the resulting text-based PDF.

What’s the maximum PDF size?

Default is 50MB. You can raise it (up to 500MB) in Settings PDF Search Maximum File Size.

Will it slow down my site?

No. Indexing runs in the background and search uses the index. Large PDFs are processed asynchronously.

Password-protected PDFs?

They cannot be indexed because the plugin cannot read their content without the password.

Multisite?

Yes. Each site has its own index.

Reviews

Read all 1 review

Contributors & Developers

“WebEquipe PDF Search” is open source software. The following people have contributed to this plugin.

Contributors

Translate “WebEquipe PDF Search” into your language.

Interested in development?

Browse the code, check out the SVN repository, or subscribe to the development log by RSS.

Changelog

1.0.1

  • Block theme and theme compatibility: PDF meta shows in block themes (e.g. Twenty Twenty-Four/Five) and themes without excerpt block; no duplicate preview or double meta (Astra/Elementor).
  • Theme-agnostic CSS: only webequipe-pdf-* classes; improved preview/meta sizing and alignment.
  • “Show Author” setting to show uploader name in result meta; Avada compatibility for PDF excerpts.
  • Help page and PHPCS/compliance updates.

1.0.0

  • Initial release
  • Automatic PDF indexing on upload (optional)
  • Full-text search in WordPress search and via shortcode
  • Settings page: indexing, display options, shortcode, PDF list
  • Media Library: index status and per-PDF actions (Index, Unindex, Exclude)
  • Bulk actions: Index, Unindex, Include, Exclude
  • Exclusion system to keep private or sensitive PDFs out of search
  • Background processing for large PDFs
  • Template tags and Help documentation
  • WordPress Multisite support