Hi, I've tried training my model with 21 different documents. It seems to extract the table ok for the first couple of pages but once it gets to the last page (3 or 4) it always seems to skip some fields. I've tried tagging the tables in multiple different ways, using advanced tagging mode, normal tagging etc but to no avail.
I've attached an example of the output of two documents with the missing fields highlighted in yellow.
Does anyone have any ideas on how I can solve this issue; or an alternative software I can use for this task?
Thanks for the comment, sorry you're having the same issue. After trying using the unstructured document format it seems to be working well 😁. Hopefully this helps you too
Hi thanks for the reply, after lots of trial and error I'm happy to say the unstructured option works well with the only limitation that you can't have multiple formats (collections) within it.
Yeah same problem. This document processing model can be used only if You need to process one format and one page file, train with lots of examples and make many many steps later
Hi @KT_NZ ,
I would suggest that you add more samples in the training set with tables that span 3 to 4 pages.
You can also try with unstructured as type of documents your model will process
Hope it helps!
Thank you