I have a desktop flow that is processing a batch of invoices in a folder into excel
I use "Extract text from PDF" to pull the contents into a text file then parse that row by row into a table and into excel.
Functionally it works great, but some of the PDF files fail to extract the text from the PDF. sometimes it's because there is a blank page at the end of the PDF, but sometimes there's no obvious reason. All these invoices are coming from the one accounts package and are the same format so I don't know why it's failing.
Anyway I could live with the failures, I append _failed onto the end of the filename and they can be processed manually.
My problem is that once one PDF fails, they will all fail after that. it's like the "Extract text from PDF" function crashes and then stays crashed. Anyone have a suggestion of how to restart that function? or reccomend a command line tool that works as well as it?