New Python OCR Libraries 2026
last commit 1 day ago paddlepaddle/paddleocr 73K +849
added 1 year ago
Awesome multilingual OCR toolkit based on PaddlePaddle . It's a an ultra lightweight OCR system with support for 80+ languages, data annotation and synthesis.
last commit 1 week ago sirfz/tesserocr 2K +2
added 1 year ago
A Python wrapper for the tesseract-ocr API
last commit 1 year ago madmaze/pytesseract 6K +3
added 1 year ago
A Python wrapper for Google Tesseract
last commit 1 year ago lukas-blecher/latex-ocr 16K +29
added 1 year ago
Takes an image of a math formula and returns corresponding LaTeX code.
last commit 5 days ago ocrmypdf/ocrmypdf 33K +116
added 1 year ago
OCRmyPDF adds an OCR text layer to scanned PDF files, allowing them to be searched or copy-pasted.
last commit 9 months ago hiroi-sora/umi-ocr 36K +207
added 1 year ago
Free, open source, batch offline OCR text recognition tool.
last commit 3 months ago jaidedai/easyocr 29K +78
added 1 year ago