Question

How to extract HTML data from multiple tabs off a webpage

  • 18 January 2019
  • 1 reply
  • 12 views

Hi

I am extremely new to this but I have been asked to learn HTML/CSS/JAVA and FME and come up with a solution.

 

I am trying to extract data from the tabs on https://travel.gc.ca/destinations/bangladesh and write them into a single list in HTML that I can then use later.

 

In my workbench I have it extracting and writing to Excel and and HTML file. Currently it writes all the tabs to excel (although it looks like jargen) but I can only do one tab at a time in the HTML?

 

Is there a way to one remove all the HTML "code" from the excel and two write multiple tabs into the single HTML file?

 

thank you


1 reply

Userlevel 2
Badge +17

It seems that "details" tag with "id" attribute contains each tab content. If my observation was correct, the HTML extractor with this setting would extract every tab content as HTML fragment and populate them into a list attribute. You can them extract each fragment with the ListExploder.

Reply