I was wondering what everyone's thoughts might be on how to approach or solve this scenario.
I have an email inbox that receives pdf invoices, I extract fields from the invoice document to a table called "Invoices". I would like to then attempt to match the Invoice to the purchase order table, If I cannot match exactly, I'd like to find the highest similarity score based largely on 3 values being most similar (ideally expressed as a percentage). 2 of the 3 fields are text fields (vendor name, and description) and last of the 3 fields is a number value. ( Ideally expressed as a percentage but whatever, I'm desperate)
Here's a picture of a fake mock-up if it helps


Report
All responses (
Answers (