Skip to main content

Notifications

Community site session details

Community site session details

Session Id : jdcfzasOaD54BxZer1YIdR
Power Automate - Power Automate Desktop
Answered

Power Automate - Webscraping help

Like (0) ShareShare
ReportReport
Posted on 4 May 2022 21:41:20 by

Hi MS Community

 

I'm new to Power Automate, and am trying to create a flow that will extract text data from each item on a webpage (web scraping project). Only problem is that the data is not available on the first webpage "parent page". To get to the item data, I must click on an item from the parent page which then takes me to another webpage (same web-site, but item specific) "item page" where my desired data is stored. I've seen the tutorials on how to extract data from webpages, but am having trouble finding resources on how to create a program which: clicks on each item, extracts data from that item page, returns to parent page, and then repeats for all items. Any direction / guidance would be greatly appreciated.

  • Agnius Bartninkas Profile Picture
    10,045 Most Valuable Professional on 19 Oct 2023 at 06:17:37
    Re: Power Automate - Webscraping help

    Have you actually read the reply by @Pavel_NaNoi above? It's explained there, but your screenshot does not follow his suggestion. Please modify your extraction so it matches his and then it should work.

  • ishanjayn Profile Picture
    3 on 19 Oct 2023 at 06:08:32
    Re: Power Automate - Webscraping help

    ishanjayn_0-1697695675485.png

    I'm unable to extract the urls, could you help me with that 

     

  • Verified answer
    Pavel_NaNoi Profile Picture
    1,072 on 05 May 2022 at 10:04:33
    Re: Power Automate - Webscraping help

    Alright this would usually be a simple problem you just got a bit unlucky with the website, here's what to do:

    To get the desired effect, do the following:

    Use the "Extract data from web page" action, select at least two items, get their text for example and it should look like this:

    Pavel_NaNoi_1-1651744446428.png

    once that is done go into the advanced options (bottom left corner of the panel that you see on the right in this image)

    and here change the values seen as in the following image:

    Pavel_NaNoi_2-1651744769803.png

    should look like this:

    Pavel_NaNoi_3-1651744799728.png

    this allows you to get every single hyperlink to every product that was extracted from this page, so you can simply use a for loop with a "go to web page" action followed by another "extract data from web page" action to extract whatever data / text you want.

     

    In essence the problem here was that the webpage was just too imbued and the selector had a hard time finding what it needed.

     

    Hope this helps.

  • Community Power Platform Member Profile Picture
    on 04 May 2022 at 21:44:46
    Re: Power Automate - Webscraping help

    Example of Parent Page, and a subsequent item page:

     

    Parent Page Link: https://weedmaps.com/brands/stiiizy/products

    Item Page Link: https://weedmaps.com/brands/stiiizy/products/stiiizy-battery-starter-kit?origin=brand&boost%5Blisting_wmid%5D=952113368&boost%5Bretailer_type%5D=dispensary

     

     

Under review

Thank you for your reply! To ensure a great experience for everyone, your content is awaiting approval by our Community Managers. Please check back later.

Helpful resources

Quick Links

Understanding Microsoft Agents - Introductory Session

Confused about how agents work across the Microsoft ecosystem? Register today!

Markus Franz – Community Spotlight

We are honored to recognize Markus Franz as our April 2025 Community…

Kudos to the March Top 10 Community Stars!

Thanks for all your good work in the Community!

Leaderboard

#1
WarrenBelz Profile Picture

WarrenBelz 146,670 Most Valuable Professional

#2
RandyHayes Profile Picture

RandyHayes 76,287 Super User 2024 Season 1

#3
Pstork1 Profile Picture

Pstork1 66,004 Most Valuable Professional

Leaderboard
Loading started