Fix/171 #173

jazzido · 2017-07-28T16:16:04Z

Biggest change is moving text extraction out of ObjectExtractionStreamEngine. Previous to this change, we pasted code from PDFBox's LegacyPDFStreamEngine into ObjectExtractionStreamEngine.

We now use PDFTextStripper (which extends LegacyPDFStreamEngine) in ObjectExtractor.

…xtractorStreamEngine

* Started work on tabulapdf#171 * using PDFTextStripper instead of duplicating pdfbox's code in ObjectExtractorStreamEngine * moved textstripper to its own file * removed useless fields/methods in ObjectExtractorStreamEngine * adjust test expectation

jazzido added 5 commits July 27, 2017 21:49

Started work on #171

50ca90e

using PDFTextStripper instead of duplicating pdfbox's code in ObjectE…

acfc2ef

…xtractorStreamEngine

moved textstripper to its own file

cb084bc

removed useless fields/methods in ObjectExtractorStreamEngine

7a39372

adjust test expectation

a28870d

jazzido merged commit ec02165 into master Jul 28, 2017

jazzido deleted the fix/171 branch July 28, 2017 16:18

jazzido mentioned this pull request Jul 28, 2017

upgrading to tabula-java-1.0.0 tabulapdf/tabula#707

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Fix/171 #173

Fix/171 #173

Uh oh!

jazzido commented Jul 28, 2017 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Fix/171 #173

Fix/171 #173

Uh oh!

Conversation

jazzido commented Jul 28, 2017 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

jazzido commented Jul 28, 2017 •

edited

Loading