SchemaScanner behaving differently inside custom transformer?

Question

Hello,Can anyone explain the difference shown below? Inside a standard workflow, the SchemaScanner captures the order of the attribute names correctly as they appear on the input file. However, when used inside of a custom transformer, the incoming attribute names are sorted alphanumerically. Does anyone have any ideas why this is happening?

fmelizard · Answer

As Spock would say, "Fascinating".Sadly, I know exactly why this is happening. And I also cannot see any easy or helpful way around it.  Let me explain and then perhaps some idea can emerge.The reason this ultimately is going on is that fundamentally, at translation run time, data flowing through FME does not have any concept of column order. The actual data rows that fly through and are processed do not themselves have any ordering of attributes.  The SchemaScanner itself works on those attributes that are on data features, and has no way of knowing what the original source ordering of those attributes was. So in the absence of that, it sorts them on output.And that was the best we could do, and how the first iterations of the SchemaScanner were delivered a year or so ago.  But customers like you didn't like this and wanted the original order preserved.  So the team dug in deep and made use of the fact that at design time, while you're working in Workbench, Workbench knows what the original order of the columns was:  So when you put down a SchemaScanner on the "main canvas", we quietly send that ordering information into the SchemaScanner as part of its configuration.  If you right click and copy a SchemaScanner transformer on a workbench canvas, and then paste into your favorite text editor, you'll see a line like this:  TEMPLATE_SCHEMA { IdNr,uint32,GPSTime,uint64,Latitude,real64,Longitude,real64 } Which we generated from the information on the canvas and passed on, via this hint line, to the FME translation engine so it can use it at translation run time to give a hint as to the preferred order of theattributes.What happens if you put the SchemaScanner inside a custom transformer?  Well, in a custom transformer we don't know what attributes are going to be coming in, and so inside a custom transformer we aren't able to show them, nor do we know what they are. (This is also true because a single custom transformer might be used in multiple places in the translation, where the schemas coming in could be quite different).Bottom line -- this means that a SchemaScanner inside a custom transformer won't have any TEMPLATE_SCHEMA hint, and so can only go for sorting the attributes when it kicks out a schema.I have a couple of ideas for working around this *for particular scenarios*, so if you're able to share the key parts of your workspace, or more details on thescenario you're solving, I'm game to explore the solution space.  The easiest of course is just to do the SchemaScanning outside of the custom transformer, if that is a possibility (and I apologize for that in advance...)Sorry, and hope this explanation helps.

SchemaScanner behaving differently inside custom transformer?

2 replies

Reply

Community Stats

Reply

Community Stats

Sign up

An FME Account is required to contribute

Login to the community

An FME Account is required to contribute

Scanning file for viruses.

This file cannot be downloaded