Question

HTML reader

7 years ago
June 21, 2018
3 replies
2 views

+11

boubcher
Contributor
212 replies

Hello, there I am looking to extract data from a public web site,

I used the HTML Table reader, but I am not getting the expected value in the table

website: https://www.stats.gov.sa/en/160

+50

redgeographics
Celebrity
3643 replies
7 years ago
June 21, 2018

That table is being generated dynamically through Javascript, the HTML table reader isn't able to access the cell contents. If you want to get hold of the Excel files you could probably read them directly, just plug the url into an Excel reader. The url pattern looks fixed so that should be ok.

+11

boubcher
Author
Contributor
212 replies
7 years ago
June 21, 2018

redgeographics wrote:

That table is being generated dynamically through Javascript, the HTML table reader isn't able to access the cell contents. If you want to get hold of the Excel files you could probably read them directly, just plug the url into an Excel reader. The url pattern looks fixed so that should be ok.

you are right

the only problem is we need to collect all those links manually , I was looking to extract all those links automatically, any sugestion ?

+50

redgeographics
Celebrity
3643 replies
7 years ago
June 22, 2018

redgeographics wrote:

That table is being generated dynamically through Javascript, the HTML table reader isn't able to access the cell contents. If you want to get hold of the Excel files you could probably read them directly, just plug the url into an Excel reader. The url pattern looks fixed so that should be ok.

I'm afraid not, on closer inspection it looks like there's pretty big differences from year to year about how the data is offered. The file url's don't actually appear in the page source so scraping it isn't going to work either.

Reply

Rich Text Editor, editor1

HTML reader

3 replies

Reply

Helpful Members This Week

Recently Solved Questions

Create date segments of two table with overlap of times

Automate Fanout of columns/splitting attributes to different output by attribute name

Tracing Multiple Networks from Sources to Valves Without Python

FME Flow version control how to use different branch

Parameters within group parameters not available in a webhook?

Community Stats

Latest FME

Cookie policy

Cookie settings

Reply

Related Topics

Extract muliple values from elements in xml, nested in an attributeicon

XMLXQueryExtractor from samplericon

Learning xfmap - problems with nested lists...icon

XMLUpdater - Conditional XQuery to keep existing element values if update attributes are emptyicon

Extract attributes from XML attributeicon

Helpful Members This Week

Recently Solved Questions

Create date segments of two table with overlap of times

Automate Fanout of columns/splitting attributes to different output by attribute name

Tracing Multiple Networks from Sources to Valves Without Python

FME Flow version control how to use different branch

Parameters within group parameters not available in a webhook?

Popular Tags

Community Stats

Latest FME

Sign up

An FME Account is required to contribute

Login to the community

An FME Account is required to contribute

Scanning file for viruses.

This file cannot be downloaded

Cookie policy

Cookie settings