Skip to main content
Question

How to extract HTML data from multiple tabs off a webpage

  • January 18, 2019
  • 1 reply
  • 123 views

Hi

I am extremely new to this but I have been asked to learn HTML/CSS/JAVA and FME and come up with a solution.

 

I am trying to extract data from the tabs on https://travel.gc.ca/destinations/bangladesh and write them into a single list in HTML that I can then use later.

 

In my workbench I have it extracting and writing to Excel and and HTML file. Currently it writes all the tabs to excel (although it looks like jargen) but I can only do one tab at a time in the HTML?

 

Is there a way to one remove all the HTML "code" from the excel and two write multiple tabs into the single HTML file?

 

thank you

This post is closed to further activity.
It may be an old question, an answered question, an implemented idea, or a notification-only post.
Please check post dates before relying on any information in a question or answer.
For follow-up or related questions, please post a new question or idea.
If there is a genuine update to be made, please contact us and request that the post is reopened.

1 reply

takashi
Celebrity
  • January 20, 2019

It seems that "details" tag with "id" attribute contains each tab content. If my observation was correct, the HTML extractor with this setting would extract every tab content as HTML fragment and populate them into a list attribute. You can them extract each fragment with the ListExploder.