Hi SuperPoweAutomators,
I am currently working on an invoice processing project using AI Builder for my organization. The extraction is supposed to happen as soon as the invoices of .pdf file-types are received via Outlook. I have 5 collections of invoices that I have to train my model on. The objective is to tag specific data from these invoices for extraction and map them into a defined Excel table.
My challenge is:
Some invoice collections have multiple records on one page and each record needs to be mapped to the target Excel sheet as a single record I am having a hard time figuring out how to train my model since only some of these documents come with multi-customer records on a single page.
In the screenshot I have provided here, I have a single-page invoice that has got 3 different customer records (three invoices merged into one invoice per se) and it could even be more (some can be a single page with a single record) which is easier to tag and extract. The red lines in the screenshot indicate the end and beginning of the records. How do I approach this problem and get a solution that works?
The required data on this invoice is:
IN,
OUT,
Nights,
GuestName,
RatePerNight(this will be multiplied by the number of Nights to evaluate the AccommodationRate), and
TotalMeals+OtherCosts
If there is anything that you need clarity on, kindly ask me, and thanks for your assistance in advance.