Skip to main content
Question

HTMLExtractor using basic login

  • April 19, 2018
  • 2 replies
  • 24 views

I'm very new to FME and am trying to extract some data from a webpage. The page requires a basic login. I can use httpFetcher to connect to the page but how do I then extract the table elements from the response body?

 

 

This post is closed to further activity.
It may be an old question, an answered question, an implemented idea, or a notification-only post.
Please check post dates before relying on any information in a question or answer.
For follow-up or related questions, please post a new question or idea.
If there is a genuine update to be made, please contact us and request that the post is reopened.

2 replies

david_r
Celebrity
  • 8394 replies
  • April 19, 2018

I would recommend that you use the HTTPCaller to retrieve the HTML document, it's much easier to specify authentication parameters there. You can then send the resulting HTML to the HTMLExtractor in an attribute. Example HTTPCaller setup for basic auth:

In the HTMLExtractor, get the contents from the _response_body attribute, e.g:


danilo_fme
Celebrity
Forum|alt.badge.img+51
  • Celebrity
  • 2077 replies
  • April 19, 2018

I would recommend that you use the HTTPCaller to retrieve the HTML document, it's much easier to specify authentication parameters there. You can then send the resulting HTML to the HTMLExtractor in an attribute. Example HTTPCaller setup for basic auth:

In the HTMLExtractor, get the contents from the _response_body attribute, e.g:

Nice explanation @david_r !!!