Skip to main content
Question

Can FME find the host location of a website?

  • September 7, 2016
  • 5 replies
  • 20 views

bubblebeb
Contributor
Forum|alt.badge.img+6

I am wanting to feed in a number of web urls and I would like to find the host / server location (and IP address ideally) for the sites.

I seem to remember seeing this done at a conference on FME.

Thanks

5 replies

david_r
Celebrity
  • September 7, 2016

Maybe you could use something like this service: http://ip-api.com/docs/

To split up an URL into the different bits and pieces you could use the "urlparse" Python module. It is mentioned in this old post, but unfortunately the code samples have been a bit mangled by an upgrade in the forum software: https://knowledge.safe.com/questions/2581/getting-parameters-from-a-url.html


davideagle
Contributor
Forum|alt.badge.img+21
  • Contributor
  • September 7, 2016

Hi Seb,

Perhaps you could try our custom transformer the IPGeocoder to do this.

https://hub.safe.com/transformers/ipgeocoder


bubblebeb
Contributor
Forum|alt.badge.img+6
  • Author
  • Contributor
  • September 7, 2016

Many thanks both 1spatialdave and david_r.

I have gone with the ip-api.com version as it gives more detail on the ISP.

Many thanks

Seb


david_r
Celebrity
  • September 8, 2016
david_r wrote:

Maybe you could use something like this service: http://ip-api.com/docs/

To split up an URL into the different bits and pieces you could use the "urlparse" Python module. It is mentioned in this old post, but unfortunately the code samples have been a bit mangled by an upgrade in the forum software: https://knowledge.safe.com/questions/2581/getting-parameters-from-a-url.html

I just saw this on their website:

 

Our system will automatically ban any IP addresses doing over 150 requests per minute.

 

 

If that might be an issue, you could use a Decelerator to avoid hitting the limit, e.g. this will limit yourself to maximum 120 lookups per minute:

 

 

 

 


bubblebeb
Contributor
Forum|alt.badge.img+6
  • Author
  • Contributor
  • September 8, 2016
david_r wrote:

Maybe you could use something like this service: http://ip-api.com/docs/

To split up an URL into the different bits and pieces you could use the "urlparse" Python module. It is mentioned in this old post, but unfortunately the code samples have been a bit mangled by an upgrade in the forum software: https://knowledge.safe.com/questions/2581/getting-parameters-from-a-url.html

 

Thanks david_r

 

I spotted that as well. I don't think I will be hitting it with 150 requests a minute.

 

 

Thanks

 

Seb

Cookie policy

We use cookies to enhance and personalize your experience. If you accept you agree to our full cookie policy. Learn more about our cookies.

 
Cookie settings