Question

Extract informations from a web page

  • 29 September 2022
  • 3 replies
  • 2 views

Badge

Hello everyone,

I have to extract information from a web page that shows a map with a database connected to a list of projects in the area, could anyone tell me how to do it?

 

Here you are a print screen of the web pagePage web


3 replies

Userlevel 5
Badge +25

Can you share the url?

It really depends on how the web page is set up. If it's all in relatively simple HTML/Javascript it might be possible, but many web maps use more complex libraries.

Badge

this one --> https://prestations.vd.ch/pub/actiscamac/101091/A5797K-JDGTM7/results

Userlevel 5
Badge +25

Is this a portal that requires an account? I get a "session expired" page?

 

Either way, if you right-click the page in your browser and choose "view source" (or something to that effect, depending on your browser/language) you should see the page source. If you see stuff in there that resembles coordinates you're probably going to be able to scrape it.

 

There's an HTML reader and then you can use the HTMLExtractor transformer, or the StringSearcher, to try and get the information out of it. It probably won't be easy and it helps if you have some background knowledge about HTML and whatever scripting language that page uses.

Reply