Question

FeatureMerger turns nulls/missing into asian characters

1 year ago
February 12, 2024
6 replies
143 views

takanaka
Contributor
9 replies

Hi,

Unfortunately i cannot provide the test data due to confidential data. I also cannot encode/dummify the data because then the issue no longer occurs. It is likely related to the large amount of data being processed. What I can include is the image you see here.

My question is: how can the following happen? Is this a bug?

I merge two flows, both have a lot of attributes. Requestor is 1.9m features. Supplier is 6k features. Supplier has only in ~1% of cases a value for attributes ‘X’ and ‘Y’ - the other 99% is <missing>. But supplier also has a lot of other data and attributes.

After merging, for attributes x and y a lot more values are found. They are chinese or japanese characters. They are supposed to be numbers or english words.. Also I think they are getting also values from other attributes entirely, but then the actual value instead of random chinese or japanese characters. How is this happening?

I am sure that: the attribute I test is the same (I duplicated the statisticscalculator). I am sure that: the attribute doesnt exist in the requestor flow.

My FME version is 2023.0.0.3; build 23319.

+35

virtualcitymatt
Celebrity
1849 replies
1 year ago
February 12, 2024

Very interesting indeed - when you say “both have lots of attributes” how many are we talking here?

Does the same issue happen with the FeatureJoiner?

Also, I’m not sure if this helps, but can you use a Null attribute mapper to map the missing values in those to null? You can then change the Ignore Nulls setting from No to Yes in the FeatureMerger - This should keep the same behaviour as if they were missing.

takanaka
Author
Contributor
9 replies
1 year ago
February 13, 2024

virtualcitymatt wrote:

Hi Matt, yesterday I also discovered that the fix is to use ‘Ignore Nulls = yes’ in the ‘FeatureMerger’.

Based on your comment i tried the NullAttributeMapper, this also fixes the issue while Ignore Nulls = no.

Also I tested featurejoiner (inner mode): the issue does not occur in this way. So definitely it is something in the FeatureMerger I imagine.

Requestor (left) has only 9 attributes of which 7 exposed. Supplier (right) has 17 attributes of which 13 exposed. Maybe not a whole lot, but at least half of attributes are 100% filled (Some are strings up to 100chars for example).

andrewkb
Contributor
2 replies
11 months ago
June 5, 2024

Hi, I had this issue of Chinese characters appearing in my fields. I narrowed it down to a Tester in my case. Now a Tester shouldn’t change data, yet some features were coming out corrupted.

I think it happened due to the fact that I had rearranged field order a lot in the workbench. I got data values from one field appearing in another field without any attribute mapping, as well as Chinese characters and empty strings. It’s a bug.

I fixed it by tracing through feature caches to find where it was happening, then replaced the Tester with TestFilter. I also now try not to mess with field order too much, just reorder at the last step.

I was using 2022.0.

+35

virtualcitymatt
Celebrity
1849 replies
11 months ago
June 5, 2024

andrewkb wrote:

Hi, I had this issue of Chinese characters appearing in my fields. I narrowed it down to a Tester in my case. Now a Tester shouldn’t change data, yet some features were coming out corrupted.

I was using 2022.0.

Crazy - good finding. If you can share the reproduction I can test if it’s still an issues in FME 2024 and submit a bug report for them

+18

DanAtSafe
Safer
326 replies
11 months ago
June 5, 2024

Hi @andrewkb This problem was fixed in FME 2022 and 2023. Please submit a ticket if you still see this issue with FME 2023.2 or 2024.0+

andrewkb
Contributor
2 replies
11 months ago
June 11, 2024

So I thought I fixed this but sadly no. I kept getting nulls and Chinese chars immediately after my file geodatabase reader. I rebuilt the entire workbench and got the same issue happening.

Found from another post this is a bug with 2022.0 Bulk Mode. Fix is to use the Decelerator after your reader, then everything works.

Lucky for me I found this in time, I was close to needing someone to talk me down from a ledge… ha ha

https://community.safe.com/general-10/errors-every-10-000-records-in-fme-desktop-2022-0-how-to-fix-this-20690

ttps://docs.safe.com/fme/2021.0/html/FME_Desktop_Documentation/FME_Workbench/Workbench/Improving-Performance-Bulk-Mode.htm

Reply

Rich Text Editor, editor1

Cookie policy

We use cookies to enhance and personalize your experience. If you accept you agree to our full cookie policy. Learn more about our cookies.

Cookie settings

We use 3 different kinds of cookies. You can choose which cookies you want to accept. We need basic cookies to make this site work, therefore these are the minimum you can select. Learn more about our cookies.

Basic
Functional

Normal
Functional + analytics

Complete
Functional + analytics + social media + embedded videos + marketing

FeatureMerger turns nulls/missing into asian characters