Skip to main content
Question

web scraper for noobs

  • March 6, 2019
  • 3 replies
  • 72 views

http://www.sharkattackdata.com/place/australia how can i scrape this page and get the table as a CSV, Im a total noob and ive tried watching youtube but the tutorials are still above my level and i cant seem to apply them to my job. I know i need a HTTP Caller,

This post is closed to further activity.
It may be an old question, an answered question, an implemented idea, or a notification-only post.
Please check post dates before relying on any information in a question or answer.
For follow-up or related questions, please post a new question or idea.
If there is a genuine update to be made, please contact us and request that the post is reopened.

3 replies

itay
Supporter
Forum|alt.badge.img+18
  • Supporter
  • March 6, 2019

Hi @teeks82,

I suspect you will have to dive into css to help you extract the data from the web page.

Normally the HTML Table Reader should be able to do that, but that depends on how the web page is made and after trying it myself I could not get the correct information.

Another option is using the HTMLExtractor transformer, and for that you need to dive into css selectors.

Hope this helps.

Itay


gazza
Contributor
Forum|alt.badge.img+6
  • Contributor
  • March 7, 2019

The data behind the table is available in a json file at http://www.sharkattackdata.com/api/attacks?country=australia you should be able to use a json reader to access it directly.


itay
Supporter
Forum|alt.badge.img+18
  • Supporter
  • March 7, 2019

The data behind the table is available in a json file at http://www.sharkattackdata.com/api/attacks?country=australia you should be able to use a json reader to access it directly.

A lot easier....