PDF Liberation

Popular repositories Loading

  1. A place to collect and share knowledge about liberating data from PDFs

    Shell 55 7

  2. Forked from jsfenfen/whatwordwhere

    Tooling to extract data from scanned paper forms OCR-ed by Tesseract using the HOCR standard.

    Python 22 5

  3. Resources related to PDF Liberation hackathon

    12 10

  4. experimenting with pdf2text and python pdf-table-extract

    JavaScript 11 3

  5. This project will liberate data from pdf files found on http://www.cityofjerseycity.com/pub-info.aspx?id=2430 and will create .csv and .json files to be uploaded on https://data.openjerseycity.org/…

    Python 6 1

  6. (DC team) experimenting with available options for extracting info from PFDs

    Python 4 2