Skip to content

A tool that leverages tabulapdf to extract table from pdf performs some further processing and stores the resulted tables to csv files

Notifications You must be signed in to change notification settings

vasgat/pdf-table-extractor

Repository files navigation

pdf-table-extractor

PDF Table Extractor is a tool that leverages tabulapdf library to extract tables from pdf files and performs some further processing and stores the resulted tables into csv formatted files.

Note: Here you can find an online version of the tool and some documentation for better usage

You can run pdf-table-extractor from command line as follows:

      java -jar .\pdf-table-extractor-1.0.jar ..\pdf_examples\COGSuppliers.pdf

      java -jar .\pdf-table-extractor-1.0.jar ..\pdf_examples\Benetton.pdf WITHOUT_RULINGS UPPER_MERGE

About

A tool that leverages tabulapdf to extract table from pdf performs some further processing and stores the resulted tables to csv files

Resources

Stars

Watchers

Forks

Packages

No packages published

Languages