Skip to main content

I'm trying to automate my downloads of government environmental data. Unfortunately the zipped shapefiles change name because they put the date in the file name. I've found that they all end up in this location though: http://jncc.defra.gov.uk/files

So my current thinking is to get FME read this page and filter on the zip file name, then use httpcaller to download the right zip. When I go to this page in my we browser it displays a bit like a table - is there a way to get FME to read it as a table?

 

 

Try using the HTMLExtractor:

This will return a list zip_files{} with one element for every zip file referenced (HREF) on that page.


Try using the HTMLExtractor:

This will return a list zip_files{} with one element for every zip file referenced (HREF) on that page.

This looks promising, thanks. at the moment I'm getting HTMLExtractor: <type 'exceptions.RuntimeError'>: maximum recursion depth exceeded - is the recursion limit in my settings?


This looks promising, thanks. at the moment I'm getting HTMLExtractor: <type 'exceptions.RuntimeError'>: maximum recursion depth exceeded - is the recursion limit in my settings?

Which version of FME are you using? I tested with FME 2019 and it worked fine.


Which version of FME are you using? I tested with FME 2019 and it worked fine.

2018. I was using 32 bit but ill give 64 bit a try