Hi All,
I have a bunch of PDF docs with tabular data in them which I need to extract to process and run calculations on.
Is there anything out in the world (preferablly free, open source) that is able to get tabluar data out of PDFs into a more readable format in bulk either natively integrated with an app or passively via command line or looping the process in code?
Can be any format really just as long as the tables are maintained.
Anything I've found so far is either a one-off (only does one pdf at a time) or does not maintain the table structure (only extracts simple, unstructured text)
Any ideas please post.