[Monster PR] Upgrade to PDFBox 2.0 by jazzido · Pull Request #150 · tabulapdf/tabula-java

and others added 30 commits

December 3, 2015 17:01
org.apache.pdfbox.examples.util.RemoveAllText
… string

Add a test writer two tables for CSV output
# Conflicts:
#	src/test/resources/technology/tabula/json/schools.json
#	src/test/resources/technology/tabula/json/spanning_cells.json
#	src/test/resources/technology/tabula/json/spanning_cells_basic.json
#	src/test/resources/technology/tabula/json/twotables.json

@jazzido

@jazzido

EmpowerZ pushed a commit to EmpowerZ/tabula-java that referenced this pull request

Oct 23, 2020
* Starting with upgrade to PDFBox 2.0 (tabulapdf#52)

* 2.0

* little progress in upgrading to pdfbox 2

* upgrade to pdfbox 2 starting to show signs of life

* Fix TextElement creation

* fix tabs

* Use the code from LegacyPDFStreamEngine to create the TextElements

* Fix removeText function using the example:

org.apache.pdfbox.examples.util.RemoveAllText

* close the document

* close removed text document

* fix array serialization

* add spanning cells test with CSV format

* - Remove capheight calculation
- Temporally set height

* Test writer two tables checking the json result object instead of the string

Add a test writer two tables for CSV output

* Fix pageTransform when there is a rotation
Add more csv tests

* fix path iterator

* update json tests

* update json outputs

* upgrade pdfbox version

* back to the old implementation and catch the IndexOutOfBoundsException

* Remove hardcoded code

* Remove more hardcoded code

* test all the elements of the detected table

* Change the expected table top value

* Increase the threshold factor to support a greater headings

* Fix rectangle comparator.

* fix wrong expected column size, 5 instead of 6.

add more tests

* update expected table, more spaces are expected to respect the alingment.

* when the text value has length > 1, clean the spaces.

* clean code

* remove stackstrace

* add log error

* upgrade all dependencies

* code formatting

* setting pom to snapshot version