Skip to main content

Hello,

 

After struggling with several transformers, I can't figure out how to simply remove duplicates from 2 identical Excel files with exact same column names, etc ... but with different numbers of rows.

 

First file contains 3130 entries and second one 2926. What I want is to be able to extract the 204 missing lines to check their differences.

 

In my small brain, a simple transformer that checks missing entries between two columns and group them in an inspector should exists but it seems not to be complicated enough ... ;o)

 

Thanks for any help

Did you try the ChangeDetector?


Just tried but no way. I always get 3130 "deleted" and 2926 "inserted" whatever parameters I changed.


Just tried but no way. I always get 3130 "deleted" and 2926 "inserted" whatever parameters I changed.

Can you share a screen dump of ChangeDetector configuration.


Here it is.


Here it is.

The problem is that the primary keys aren't called the same in both tables. As it is currently configured, the ChangeDetector will try to match based on the combination of both ID and Fuld_ID on both tables. Since one table only has ID and the other only has Fuld_ID, there are no matches.

Try renaming "Fuld_ID" to just "ID" on the "Data_Final" input before the ChangeDetector. You should then remove "Fuld_ID" from the ChangeDetector settings.


It's working, thanks a lot for your help !

 

My bad, I changed the field's name to ease selection from two different sets but didn't thought (nor understood from docs) an exact match was needed for the transformer to work.

 


Reply