Hello,
I am using the Unstructured Documents Custom Model for some PDF contracts.
The original PDFs are structured like this:
| Proposal Number | 13450 | | | | | | | Order Date | 08/31/2023 |
| Buyer | Jane Doe | | | | | | | Version | 3 |
| Order Number | 580 | | | | | | | | |
| | | | | | | | | | |
| | | | | Week | | |
| Refererence | Product | Category | Rate | 1 | 2 | 3 | 4 | Total Cost | |
| 1 | Product A | COR | 1656.00 | 1 | 2 | | 5 | 13248.00 | |
| | 09333059 | | | | | | | | |
| 2 | Product B | RET | 50.00 | 3 | 5 | 9 | 3 | 1000.00 | |
| | 037384934 | | | | | | | | |
I have trained the model to recognize the header fields and the items table. The model successfully recognizes multipage table results like this:
| ref | prod | prod_code | cat | rate | wk_1 | wk_2 | wk_3 | wk_4 |
| 1 | Product A | 09333059 | COR | 1656.00 | 1 | 2 | | 5 |
| 2 | Product B | 037384934 | RET | 50.00 | 3 | 5 | 9 | 3 |
How to I use the model in a flow to export the header fields and table to a flat file csv like this?
| prop_no | buyer | order_no | order_date | version | ref | prod | prod_code | cat | rate | wk_1 | wk_2 | wk_3 | wk_4 |
| 13450 | Jane Doe | 580 | 08/31/2023 | 3 | 1 | Product A | 09333059 | COR | 1656.00 | 1 | 2 | | 5 |
| 13450 | Jane Doe | 580 | 08/31/2023 | 3 | 2 | Product B | 037384934 | RET | 50.00 | 3 | 5 | 9 | 3 |
Please note I am looking to export each document's data to separate csv files.
Many thanks in advance for your help!