I am new user here, trying to develop a flow to extract the table data from this web page - " https://seoulsemicon.com/en/support/documentlibrary " unable to extract the Image URL - basically am looking for PDF link Extraction option.
Can someone guide or provide inputs, appreciate all your support in advance. Thanks.,
Hello,
I don't think that site has the PDF download links in the HTML. When pressing the download button It seems to trigger Javascript OnClick that does the download. So you can't get the PDF link extracted with PAD.
I was able to get the url when downloading the pdf and then looking at the response and using the path as the endpoint.
Like so:
Then when you use that as a link it downloads a file
File downloaded:
Rename it to have .pdf extention. Then it should be the correct pdf.
But since that ending part seems to be UUID (a97a9ccf-1ed1-4000-996d-eaba6ee5ca88) you cant really know it before you have downloaded the file. So you would need to download each file to get the URL. And even if it would be okay for you that you download each file to get the URL I am not sure how to do that other than make PAD flow which goes into the browsers Inspect -> Network -> Response and extract the path. So at least for my knowledge not an easy/quick task to do.
Was this reply helpful?YesNo
Under review
Thank you for your reply! To ensure a great experience for everyone, your content is awaiting approval by our Community Managers. Please check back later.