Hi,
I am setting up one framework based on Flow + Sharepoint for following purposes.
- Fully automatically detect Word document modification in specific folder of one Sharepoint site document library.
- Fully automatically parse the content of the modified Word document to identify self-defined pattern and output the identified information.
Point 1 is simple and implemented.
For point 2, the solution is still open in my mind but some ideas are coming as below.
- No idea on whether there are some Sharepoint Apps capable to parse Word document content.
- python-docx should be able to parse one Word document for my purpose. However, it is difficult to integrate with enterprise Sharepoint for full automatic purpose.
- In case locally running one set of python programs based on python-docx module and automatically fetching the Word document from Sharepoint site, the obstacle is the authentication to the cloud.
- Quite open to directly embed one piece of python code to one Flow connector for Word document content parsing.
- When attempting to send one file to one local location, I failed with status "BadGateway".
Welcome any idea for my point 2. Thanks a lot!
The same topic is also posted in https://answers.microsoft.com/en-us/msoffice/forum/msoffice_word-mso_other-mso_o365b/automatically-parse-office-word-document-located/ff470774-1102-4c6d-8142-df75264f7bc6