Hi guys,
I have a question regarding on how to convert pdf files to excel file or csv file. Here is an example of what one of my pdfs look like:
As seen from above image, there is a table. I had used the action "Extract tables from pdf", but it results in gibberish data. So I wanted to know if there was a way to get the information of which I barred out from this image. Like PO #, PO Date and even the whole table.
I know that there is AI Builder to solve this, but that uses Premium. I wanted to utilize the full capacity of non-premium features of PAD.
Tags but open to anyone's suggestions: @Deenuji , @Agnius , @UshaJyothi20 , @Nived_Nambiar , @WillSG , @eetuRobo
As @Deenuji said, we need to see how each pdf file looks like , like format of invoices are same or not.
If format is same, then we can try with option for regex or string manipulation to capture the required details from pdf file.
Nived N 🚀
LinkedIn: Nived N's LinkedIn
YouTube: Nived N's YouTube Channel
🔍 Found my answer helpful? Please consider marking it as the solution!
Your appreciation keeps me motivated. Thank you! 🙌
If we opt for regex, I anticipate it might be a challenging process, especially if your PDFs have varying formats.
I would like to provide more informed advice after analyzing two or three PDF files. If it's not sensitive information, could you please share them privately or attach them here for review?
You can use read pdf text with OCR and you can get the whole data
And also you can use regex and parse text activity and get all the required values....
If all the PDFs are same format then please check the table data output try to sort it out how you are getting the output and what you need.....
If you want any further clarification please share sample pdf so that I can give a try...
Hope this helps
Usha
WarrenBelz
146,776
Most Valuable Professional
RandyHayes
76,287
Super User 2024 Season 1
Pstork1
66,093
Most Valuable Professional