Solved

extracting link from html xquery issue.

2 years ago
August 30, 2022
5 replies
31 views

+3

johnglick
Contributor
32 replies

I'm attempting to extract a link from html that looks something like this: <a class="email-link" href="https://app.myapp.com/results/a7696ced77454dd39d8241e79f981b3d">here</a>. using the xquery extractor. the x-query //a/@href seems to work in the online testers but not in FME. Any ideas?

Best answer by debbiatsafe

Hi @johnglick

The rejection message on the rejected feature from XMLXQueryExtractor with the XQuery expression //a/@href is "...can not serialize attribute node".

Searching this error led to this StackOverflow answer. It seems the error is caused by serializing certain result types. Using either of the two functions mentioned in the answer, data() or string(), on the attribute does allow the XMLXQueryExtractor to successfully complete (eg. //a/data(@href) or //a/string(@href).

I'll note it is also possible to use the HTMLExtractor to extract URLs as an alternative to the XMLXQueryExtractor.

Use the XQuery expression //a/data(@href) in the XMLXQueryExtractor or CSS selector a[href] in HTMLExtractor

View original

Did this help you find an answer to your question?

+55

hkingsbury
Celebrity
1519 replies
2 years ago
August 30, 2022

Another option would be to use the StringSearcher and regular expression

https://rubular.com/r/EmlvM3VidvKFl3

+3

johnglick
Author
Contributor
32 replies
2 years ago
August 30, 2022

hkingsbury wrote:

Another option would be to use the StringSearcher and regular expression

https://rubular.com/r/EmlvM3VidvKFl3

Certainly an acceptable work around. I'm curious as to why the xquery doesn't work, any ideas?

+55

hkingsbury
Celebrity
1519 replies
2 years ago
August 30, 2022

johnglick wrote:

Certainly an acceptable work around. I'm curious as to why the xquery doesn't work, any ideas?

I'll be honest, never used xquery before, so i'm not much help there!

+20

debbiatsafe
Safer
648 replies
Best Answer
2 years ago
August 31, 2022

Hi @johnglick

The rejection message on the rejected feature from XMLXQueryExtractor with the XQuery expression //a/@href is "...can not serialize attribute node".

Searching this error led to this StackOverflow answer. It seems the error is caused by serializing certain result types. Using either of the two functions mentioned in the answer, data() or string(), on the attribute does allow the XMLXQueryExtractor to successfully complete (eg. //a/data(@href) or //a/string(@href).

I'll note it is also possible to use the HTMLExtractor to extract URLs as an alternative to the XMLXQueryExtractor.

Use the XQuery expression //a/data(@href) in the XMLXQueryExtractor or CSS selector a[href] in HTMLExtractor

+3

johnglick
Author
Contributor
32 replies
2 years ago
September 1, 2022

debbiatsafe wrote:

Hi @johnglick

The rejection message on the rejected feature from XMLXQueryExtractor with the XQuery expression //a/@href is "...can not serialize attribute node".

Searching this error led to this StackOverflow answer. It seems the error is caused by serializing certain result types. Using either of the two functions mentioned in the answer, data() or string(), on the attribute does allow the XMLXQueryExtractor to successfully complete (eg. //a/data(@href) or //a/string(@href).

I'll note it is also possible to use the HTMLExtractor to extract URLs as an alternative to the XMLXQueryExtractor.

Use the XQuery expression //a/data(@href) in the XMLXQueryExtractor or CSS selector a[href] in HTMLExtractor

Thanks! I never realized it was possible to put a parameter other than "Whole" or "Part" in the Tag Part/HTML Attribute area of the HTML extractor. Thanks to your input I can now eliminate a transformer from my workflow!

Reply

Rich Text Editor, editor1

extracting link from html xquery issue.

5 replies

Reply

Helpful Members This Week

Recently Solved Questions

Parameters within group parameters not available in a webhook?

How to restart a REST Server in ArcGIS Server?

Remove last CR/LF from a CSV

1019 error with change detector and polygons

Where is the "Show Bookmark Navigator" option in FME 2024.2?

Community Stats

Latest FME

Cookie policy

Cookie settings

Reply

Related Topics

Extracting data from SQLite Databaseicon

Dashboarding: Creating HTML tables and formatting it with XMLTemplater and XQueryicon

Extract data from SOAP XML,icon

Extracting attributes from HTML within KML with photosicon

Extracting HTML from the PopupInfo field of a GIS Layericon

Helpful Members This Week

Recently Solved Questions

Parameters within group parameters not available in a webhook?

How to restart a REST Server in ArcGIS Server?

Remove last CR/LF from a CSV

1019 error with change detector and polygons

Where is the "Show Bookmark Navigator" option in FME 2024.2?

Popular Tags

Community Stats

Latest FME

Sign up

An FME Account is required to contribute

Login to the community

An FME Account is required to contribute

Scanning file for viruses.

This file cannot be downloaded

Cookie policy

Cookie settings