Hi all,
Tearing my hair out on this one. I have a list of words (e.g. "Tom", "John" and "Harry") and need to find a streamlined way to extract the page number(s) on which each word appears.
The list of words is fixed/can be hardcoded but would ideally be read from a column in Excel and the resulting page numbers (output) would be written to a column in that worksheet, separated by commas and in the row matching the search string.
The files to be searched are most often PDFs (always file created/no need to add OCR) but occasionally include word documents.
Any ideas? I see that @antoinec assisted on something slightly similar but that required OCR and extracting pages themselves. https://powerusers.microsoft.com/t5/AI-Builder/Search-specific-text-on-a-PDF-and-get-the-page-number-it-exists/m-p/1462430?lightbox-message-images-1463051=414173i90FB0B28999BC443#M1182