Thanks for the details Eric. Given your setup cloud flow, SharePoint trigger, inconsistent headers this narrows it down to two realistic options.
AI Builder is out because the inconsistent column headers mean you can't reliably train a model on it. PAD is out because you're on a cloud flow.
Option 1: Parse the docx as XML (no extra licensing)
Since it's a cloud flow you can get the file content directly from SharePoint, then extract the table rows by parsing the underlying Open XML. The steps:
1. Get file content using path (SharePoint)
2. The .docx is a zip — use a Compose action to work with the raw XML or call an Azure Function to unzip and extract the document.xml
3. In document.xml, tables are in `<w:tbl>` elements, rows in `<w:tr>`, cells in `<w:tc>`, and text in `<w:t>`
4. Use the xpath expression in a Parse XML action to pull out cell values row by row
This works well for a table at the bottom of the document even with inconsistent headers since you're reading raw cell positions not named columns.
Option 2: Azure AI Document Intelligence (most reliable for your case)
Since the headers aren't consistent, Document Intelligence's prebuilt Layout model is actually the better fit than a custom-trained AI Builder model. It reads tables by position rather than header name, returns each cell with its row and column index, and works on any Word or PDF file without training. Requires an
Azure subscription but the pay-as-you-go pricing is low for occasional document processing.
For a one-time or low-volume trigger like a SharePoint modified file, the XML approach is the cheapest. For anything more than a few documents per day,
Document Intelligence is cleaner and more maintainable.
Which direction fits your volume and Azure access?
Best regards,
Valantis
✅ If this helped solve your issue, please Accept as Solution so others can find it quickly.
❤️ If it didn’t fully solve it but was still useful, please click “Yes” on “Was this reply helpful?” or leave a Like :).
🏷️ For follow-ups @Valantis.
📝 https://valantisond365.com/