Hi, I have a large PDF file which contains different types of tables. Each type of table is in the PDF multiple times. I would like to extract only one type of table, all occurences and write to an Excel sheet.
For example, I have a table of outstanding invoices for customers. Each table contains only the invoices for one customer. We have 30 customers, so there are 30 of these tables (table 1) in the PDF.
Within the same PDF there are tables for invoices we owe to our vendors. We have 5 vendors, so there are 5 of these tables (table 2) in the PDF.
I would like to extract only table 2 information. Above each of these tables is text stating "Invoice info for Vendor xxx" where xxx is a vendor number. I can Parse Text to find "Invoice info for Vendor", then retrieve the vendor number with Get Subtext. How do I then extract the table underneath the text? I have tried Extract Tables from PDF, but when I parse the text, I do not know what the index of the associated table is.