Solved

Reading generic XML with variable content ?

8 years ago
February 15, 2017
5 replies
10 views

+29

lifalin2016
Contributor
576 replies

Hi,

How can I best accomplish reading a generic (any) XML file without specifying any initial root tags etc. ?

I.e. return the whole XML document as a single fragment.

Do I need to create some special XRS or xfMap templates, or can it be accomplished with the simpler Feature Paths ?

Cheers

Best answer by takashi

Hi @lifalin2016, the XML reader can also be used.

When adding the reader, check the Single Merged Feature Type for the Workflow Options in the Add Reader dialog, and set writer parameters as below.

Configuration Type: Feature Paths
Feature Paths Configuration/Elements to Match: //*
Flatten Options: Uncheck the Enable Flattening checkbox

Here, the //* (two slashes and an asterisk) indicates the document root element with any name.

If you read an XML document with this setting, the reader feature type will output a single feature having "xml_fragment" attribute, which stores the entire XML document.

[Addition] /* (a single slash and an asterisk) seems to work too.

View original

Did this help you find an answer to your question?

+50

redgeographics
Celebrity
3643 replies
8 years ago
February 15, 2017

If you just want a single fragment you might as well use a TextLine reader and set it to read the whole file at once. You'll then get a single feature with one attribute containing your entire XML file. You wouldn't be able to do much with it though, so are you sure that's what you want?

+17

itay
Supporter
1441 replies
8 years ago
February 15, 2017

an idea is to first analyze the xml for its tags and pass that to a second workspace or feature reader to read the xml.

takashi
7715 replies
Best Answer
8 years ago
February 16, 2017

Hi @lifalin2016, the XML reader can also be used.

When adding the reader, check the Single Merged Feature Type for the Workflow Options in the Add Reader dialog, and set writer parameters as below.

Configuration Type: Feature Paths
Feature Paths Configuration/Elements to Match: //*
Flatten Options: Uncheck the Enable Flattening checkbox

Here, the //* (two slashes and an asterisk) indicates the document root element with any name.

If you read an XML document with this setting, the reader feature type will output a single feature having "xml_fragment" attribute, which stores the entire XML document.

[Addition] /* (a single slash and an asterisk) seems to work too.

+29

lifalin2016
Author
Contributor
576 replies
8 years ago
February 21, 2017

takashi wrote:

Hi @lifalin2016, the XML reader can also be used.

When adding the reader, check the Single Merged Feature Type for the Workflow Options in the Add Reader dialog, and set writer parameters as below.

Configuration Type: Feature Paths
Feature Paths Configuration/Elements to Match: //*
Flatten Options: Uncheck the Enable Flattening checkbox

Here, the //* (two slashes and an asterisk) indicates the document root element with any name.

If you read an XML document with this setting, the reader feature type will output a single feature having "xml_fragment" attribute, which stores the entire XML document.

[Addition] /* (a single slash and an asterisk) seems to work too.

Thanks Takashi, it worked.

I needed to read multiple XML files in a ZIP package to ensure before attempting a translation that they don't contain any errors. And each XML file has its very own schema setup, hence the need to read "generically".

Unfortunately the XML files in question are generated by an external tool outside our jurisdiction that clearly doesn't validate its own output, so we have to make sure we don't waste hours trying to import them needlessly.

This approach may come in handy in other cases too :-)

Cheers

-- Cheers, Lars I.

+17

itay
Supporter
1441 replies
8 years ago
February 21, 2017

itay wrote:

an idea is to first analyze the xml for its tags and pass that to a second workspace or feature reader to read the xml.

note to myself: read carefully!!!

Reply

Rich Text Editor, editor1

Cookie policy

We use cookies to enhance and personalize your experience. If you accept you agree to our full cookie policy. Learn more about our cookies.

Cookie settings

We use 3 different kinds of cookies. You can choose which cookies you want to accept. We need basic cookies to make this site work, therefore these are the minimum you can select. Learn more about our cookies.

Basic
Functional

Normal
Functional + analytics

Complete
Functional + analytics + social media + embedded videos + marketing

Reading generic XML with variable content ?