Skip to main content
Solved

Join on a key which is not 100% equal

  • February 22, 2021
  • 1 reply
  • 74 views

lambertus
Enthusiast
Forum|alt.badge.img+24

Hi,

 

I have two datsets which I want to combine using a key. One dataset contains busisness names of companies and the other dataset contains more or less the samen names but there are smalll differences in some cases. I want to join both datsets on the keys which are most equal to each other.

 

For example:

 

Dataset A:

Amsterdam Cars B.V.

Rotterdam Cars Deluxe

 

Dataset B:

Amsterdam Cars

Rotterdam Deluxe Cars

 

Any ideas how to solve this issue? Maybe something with the FuzzyStringComparer?

Best answer by redgeographics

The FuzzyStringCompareFrom2Datasets transformer from the Hub sounds like it'll do what you want to do, but I'd highly recommend doublechecking the results. There's so much that can go wrong with fuzzy string matching 😅

This post is closed to further activity.
It may be an old question, an answered question, an implemented idea, or a notification-only post.
Please check post dates before relying on any information in a question or answer.
For follow-up or related questions, please post a new question or idea.
If there is a genuine update to be made, please contact us and request that the post is reopened.

1 reply

redgeographics
Celebrity
Forum|alt.badge.img+62
  • Celebrity
  • Best Answer
  • February 22, 2021

The FuzzyStringCompareFrom2Datasets transformer from the Hub sounds like it'll do what you want to do, but I'd highly recommend doublechecking the results. There's so much that can go wrong with fuzzy string matching 😅