Question

How to use the CSS selector in HTML extractor

2 years ago
October 25, 2022
4 replies
177 views

+12

checcosisani
Contributor
66 replies

I would like to extract some info from a website that are inside a p tag inside a div class = paragraphs_item__body_paragraph_bundle

the website is

https://www.padovanet.it/notizia/20221025/strade-chiuse

See picture below

thx for support

webscraping Francesco

+50

geomancer
Evangelist
886 replies
2 years ago
October 26, 2022

.paragraphs_item__body_paragraph_bundle p

Select all p elements inside class paragraphs_item__body_paragraph_bundle (see HTMLExtractor and CSS Selector Reference).

HTMLExtractor_strade_chiuse

+12

checcosisani
Author
Contributor
66 replies
2 years ago
October 26, 2022

thx !

+12

checcosisani
Author
Contributor
66 replies
9 months ago
September 28, 2024

do you now if there any chance to extract info inside br tag

I use this

table > tbody > tr:nth-child(-n+10) > td:nth-child(2) > strong:nth-child(5) but I can’t expose the info inside br

this is the website

https://cloud.urbi.it/urbi/progs/urp/ur1ME001.sto?DB_NAME=wt00038560&w3cbt=S&StwEvent=9100030

thx

Francesco

+50

geomancer
Evangelist
886 replies
9 months ago
September 30, 2024

You can just use a HTTPCaller, a few HTMLExtractors and ListExploders, and an AttributeSplitter.

Note that there is no ‘inside a   tag’, as   has no corresponding  tag.   signifies a line break (after   a new line is started). FME turns   into   (I found this out by just testing).

1 Attachments

Parse_urbi.it.zip

Reply

Rich Text Editor, editor1

Cookie policy

We use cookies to enhance and personalize your experience. If you accept you agree to our full cookie policy. Learn more about our cookies.

Cookie settings

We use 3 different kinds of cookies. You can choose which cookies you want to accept. We need basic cookies to make this site work, therefore these are the minimum you can select. Learn more about our cookies.

Basic
Functional

Normal
Functional + analytics

Complete
Functional + analytics + social media + embedded videos + marketing

How to use the CSS selector in HTML extractor