ocrmypdf
  • Introduction
  • Release notes
  • Installing additional language packs

Usage

  • Cookbook
  • Advanced features
  • Batch processing
  • PDF security issues
  • Common error messages
ocrmypdf
  • Docs »
  • OCRmyPDF documentation
  • View page source

OCRmyPDF documentation¶

OCRmyPDF adds an OCR text layer to scanned PDF files, allowing them to be searched.

PDFs are the best format for scanned documents. Unfortunately, PDFs can be difficult to work with. OCRmyPDF makes it easy to apply image processing and OCR to existing PDFs.

  • Introduction
  • Release notes
  • Installing additional language packs

Usage

  • Cookbook
    • Basic examples
    • OCR images, not PDFs
    • Image processing
    • Improving OCR quality
  • Advanced features
    • Control of OCR options
    • Changing the PDF renderer
  • Batch processing
    • Batch jobs
    • Directory trees
    • Hot (watched) folders
  • PDF security issues
    • PDFs may contain malware
    • How OCRmyPDF processes PDFs
    • Using OCRmyPDF online
    • Password protection, digital signatures and certification
  • Common error messages
    • Page already has text
    • Input file ‘filename’ is not a valid PDF

Indices and tables¶

  • Index
  • Module Index
  • Search Page
Next

© Copyright 2017, James R. Barlow.

Built with Sphinx using a theme provided by Read the Docs.