GitHub - MerlinB/pycerpt

Skip to content

Navigation Menu

Sign in

Appearance settings

pycerpt is a command line utility for extracting highlighted text from PDFs.

Quickstart

Get the latest version with pip install pycerpt.

pycerpt outputs to markdown as default. Use with excerpt test.pdf or save to a file with excerpt test.pdf > out.md or excerpt test.pdf out.md.

Generating PDFs

For PDF generation additional dependencies are needed: pip install pycerpt[pdf].
Usage: excerpt test.pdf out.pdf.