pdfgrep is a commandline tool to search through PDF files. It's similar in its application to grep, and also allows searching with regular expressions or case-insensitively, and recursively through directories. It prints out filenames, page numbers, number of occurences, and most importantly a colorized match list.
Homepage
Download
Recent Releases
2.2.019 Apr 2024 21:14
major bugfix:
## Features
- `--page-number` can now take an optional argument `label` that shows the
PDF's page label instead of the page index.
## Changes
- Port to PCRE2
- Change default value of `--include` to `*. Pp Dd Ff ` to allow for different case variations of `*.pdf`.
- Build system: Switch to C++14
- Build system: Require poppler-cpp = 0.36.0, to avoid a nasty bug in earlier versions that makes pdfgrep produce wrong results. See https://bugs.freedesktop.org/show_bug.cgi?id=91644
- Manpage: Specify SYNOPSIS more precisely.
- Ignore extra whitespace at the end of pages as generated by newer poppler. This fixes e.g. `--warn-empty` and our testsuite.
## Fixes
- Fix crash with invalid locale setting
- Fix `--warn-empty` with `--cache`
1.3.110 Aug 2014 23:44
minor feature:
There's an incompatible change in this release in that the -r option no longer follows symlinks. Newly introduced is a zsh completion module, support for password-protected PDFs, and that '.' can be omitted from recursive directory searches. With the added -p or --page-count matches are counted per page, and -m or --max-count introduces limits per file.