Hey everyone! First post here, just starting to play with Power Automate Desktop. In an effort to move off of a larger more expensive RPA platform, I am putting together some POC's and doing some feasibility planning with Power Automate to show that it will be sufficient to replace our existing platform. One of our larger existing solutions on our current platform involves parsing a lot of data from insurance applications that we receive as PDF files. I am running into an issue with the built in PDF - Extract text from PDF function in that it is returning the text in a different way than expected, to the point that we would not reasonably be able to consume it.
Here is a mocked up version of one page of an application...

When using the PDF - Extract text from PDF action in PAD, this is how the content is returned...

This makes it nearly impossible to confidently ascertain which data should belong to which fields.
If I open the PDF file in Adobe Reader DC and use the built in 'Export PDF' tool and export it to a .txt file, the content gets rendered like this...

The field values come over adjacent to the field names making it consumable. This also happens to be the way the content is returned in our existing platform with its built-in 'Extract Text From PDF' command.
Conclusion: I need to find an alternative method that will extract the text from these files in a format that will be consumable. Worst case scenario is I would just automate the interaction with Adobe Reader DC but I was hoping there might be a better alternative. Thanks!