Skip to main content

Hi,

 

I have two datsets which I want to combine using a key. One dataset contains busisness names of companies and the other dataset contains more or less the samen names but there are smalll differences in some cases. I want to join both datsets on the keys which are most equal to each other.

 

For example:

 

Dataset A:

Amsterdam Cars B.V.

Rotterdam Cars Deluxe

 

Dataset B:

Amsterdam Cars

Rotterdam Deluxe Cars

 

Any ideas how to solve this issue? Maybe something with the FuzzyStringComparer?

The FuzzyStringCompareFrom2Datasets transformer from the Hub sounds like it'll do what you want to do, but I'd highly recommend doublechecking the results. There's so much that can go wrong with fuzzy string matching 😅


Reply