Skip to main content
Archived

Allow use of Xpath in HTMLExtractor

Related products:FME Form

We use the HTML Extractor a fair bit when prototyping processes internally, but it'd be very useful to be able to use XPath in addition to CSS Selectors when extracting values from HTML.


Main benefit is that XPath allows for much more granular queries for selecting tags/text and can do plenty of things that CSS selectors cannot do.


I have fiddled a bit with using the various XML transformers to extract XPath, but with standard HTML websites rarely being valid XML, it gets trickier to troubleshoot extracting values from 'bad' sources

This post is closed to further activity.
It may be a question with a best answer, an implemented idea, or just a post needing no comment.
If you have a follow-up or related question, please post a new question or idea.
If there is a genuine update to be made, please contact us and request that the post is reopened.

Cookie policy

We use cookies to enhance and personalize your experience. If you accept you agree to our full cookie policy. Learn more about our cookies.

 
Cookie settings