Hello, I am trying to create a workspace for doing a batch identification of information contained in tables within some pdf files, the thing is that the pdf also contains some other information as text, so I am a little bit confused on how to achieve that, and also if it will work with different tables, because the dataset I have was made by multiple authors so they are different tables on each pdf. I am wondering if it would be better to use a external application or a PythonCaller to achieve that, but before trying that out I would like to confirm first. I am attaching a sample pdf of my dataset so you can see the table that is in it. Thank you!
Best answer by geomancer
View original