Skip to main content
Solved

Extract data and hyperlink from web page

  • October 2, 2017
  • 3 replies
  • 119 views

boubcher
Contributor
Forum|alt.badge.img+11

Hello Guys

I am trying to extract data from table in HTML page containing links to download file,

This is the website

https://www.stats.gov.sa/en/160

Best answer by redgeographics

I agree, the file doesn't have any actualy content, I tried it both as a HTMLTable and a plain text reader but no luck.

This post is closed to further activity.
It may be an old question, an answered question, an implemented idea, or a notification-only post.
Please check post dates before relying on any information in a question or answer.
For follow-up or related questions, please post a new question or idea.
If there is a genuine update to be made, please contact us and request that the post is reopened.

3 replies

takashi
Celebrity
  • 7843 replies
  • October 2, 2017

Hi @boubcher, if you could get the HTML document containing the table element and the URLs of the files with HTTPCaller, the HTMLExtractor might help you to extract the contents of the table. However, it's hard to think of a specific way unless we can see the HTML document.


takashi
Celebrity
  • 7843 replies
  • October 2, 2017

Hi @boubcher, if you could get the HTML document containing the table element and the URLs of the files with HTTPCaller, the HTMLExtractor might help you to extract the contents of the table. However, it's hard to think of a specific way unless we can see the HTML document.

oops, I had overlooked you have posted the URL of the page.

 

The HTML page is create with JavaScript script dynamically, and doesn't seem to contain actual contents of the table. I'm afraid that it's hard to get the contents unless you know the API to access the download files.

 


redgeographics
Celebrity
Forum|alt.badge.img+60
  • Celebrity
  • 3704 replies
  • Best Answer
  • October 2, 2017

I agree, the file doesn't have any actualy content, I tried it both as a HTMLTable and a plain text reader but no luck.