PDF Search and Extract
Extract pages from PDF's
PDF Search and Extract (NEW August ‘09) is meant to extract pages from a PDF. It offers these features:
- Search for a specified string and extract pages that contain that string
- Enter regular expresions (RegEx) for advanced text searching
- Extract pages by page numbers (various options: pagenumber, even, odd, range)
- High speed text search. It takes less than a minute to search and extract pages from the Adobe PDF Reference (~ 1000 pages)
To "install": unzip the files from the downloaded ZIP-archive; place them in a folder and create a shortcut to the PdfSearchExtract executable. This tool uses the iTextSharp library (this library is integrated in the tool). The program also uses "pdftotext" from the Xpdf toolbox (www.foolabs.com) to extract plain text from the input PDF; the needed file is included in the ZIP-archive, but please check the foolabs site for the latest updates. Note that .NET 2.0 must be available on the PC. Download


