Question

CSS selector

  • 20 October 2019
  • 3 replies
  • 17 views

Userlevel 1
Badge +11

Hi

I don't understand how have to use the CCS to parse the HTML page

 

http://www.comune.torino.it/albopretorio/alboordinanze.shtml

to obtain as output a table with the 7 value (see the picture)

 

I mean the goal is the parse the HTML page to have all value 1 in the first column the nu,ber 2 in the second column and so on

thx

 

Francesco

 


3 replies

Userlevel 2
Badge +17

Hi Francesco, you can populate all the rows (i.e. <tr> elements) of the HTML table into a list attribute with the HTMLExtractor and explode it with the ListExproder, like this.

Then, parse the row to extract your desired columns with some transformers. e.g.

See also the attached workspace example to learn more: htmlextractor-example.fmw (FME 2019.1.3)

Hope this helps.

Userlevel 1
Badge +11

Hi Francesco, you can populate all the rows (i.e. <tr> elements) of the HTML table into a list attribute with the HTMLExtractor and explode it with the ListExproder, like this.

Then, parse the row to extract your desired columns with some transformers. e.g.

See also the attached workspace example to learn more: htmlextractor-example.fmw (FME 2019.1.3)

Hope this helps.

Hy Takashi

 

thx for help

 

.. I've tried also with this site that apparently is very similar but I'm not able to abtain any data

 

https://www.comune.roma.it/servizi2/gedalbonet/ricerca.action

 

can you help me ?

 

thx

 

Francesco

Userlevel 1
Badge +11

Hi

I have new question

What is the correct sintax of http extactor (if posssible) to have in one row the result ?

 

I tried this but the result isn't optimal (a lot of duplicates )

 

I don't know if it's possible to put in the same line of css selector

[id=titolo];[id=riga_link];[id=descrizione]

 

Reply