Solved

Extracting webpage and map content from url


Badge

Hi,

I would like to extract the project listing as well as the corresponding Lat/Long from the map content together with its attributes such as Project Title, Location, Project Type, Status etc. from the project link

url: https://iaac-aeic.gc.ca/050/evaluations/exploration?active=true&showMap=true&document_type=project

Sample project url:https://iaac-aeic.gc.ca/050/evaluations/proj/80774

I have limited past exposure to htmlextractor, any assistance will be much appreciated.

icon

Best answer by jdh 10 July 2020, 00:09

View original

2 replies

Badge +22

 

Will get you most of the way there. You'll need to manipulate the lists to get usable attributes. The lat/long {0}.part can just be renamed, but you'll probably want the ListKeyValuePairExtractor custom transformer to get metadata attributes.
Badge

Thanks @jdh.

Reply