Solved

Join on a key which is not 100% equal

  • 22 February 2021
  • 1 reply
  • 8 views

Badge +4

Hi,

 

I have two datsets which I want to combine using a key. One dataset contains busisness names of companies and the other dataset contains more or less the samen names but there are smalll differences in some cases. I want to join both datsets on the keys which are most equal to each other.

 

For example:

 

Dataset A:

Amsterdam Cars B.V.

Rotterdam Cars Deluxe

 

Dataset B:

Amsterdam Cars

Rotterdam Deluxe Cars

 

Any ideas how to solve this issue? Maybe something with the FuzzyStringComparer?

icon

Best answer by redgeographics 22 February 2021, 11:22

View original

1 reply

Userlevel 4
Badge +25

The FuzzyStringCompareFrom2Datasets transformer from the Hub sounds like it'll do what you want to do, but I'd highly recommend doublechecking the results. There's so much that can go wrong with fuzzy string matching 😅

Reply