Skip to main content

Hello there I have a 20K features each feature has an attribute with Long text content each text with diffirent word count , we need to compare features to each other and extract the text features that have similarity ,

Any way to do this with FME

Thanks

Hi @boubcher​ how are you?

 

Did you try the Fuzzy transformer?

 

Thanks in Advance,


Hi @boubcher​ how are you?

 

Did you try the Fuzzy transformer?

 

Thanks in Advance,

@danilo_fme​ 

Thanks for your replay

I think the fuzzystringcompareFrom2 Datasets transformer will work

I did configure the input for the 2 dataset to compare how about Id feature Type parameter do you have any idea what it s there for

Thanks

imageimage


@danilo_fme​ 

Thanks for your replay

I think the fuzzystringcompareFrom2 Datasets transformer will work

I did configure the input for the 2 dataset to compare how about Id feature Type parameter do you have any idea what it s there for

Thanks

imageimage

hi @boubcher​ do you have unique feature IDs from your two datasets? If not you can create them by using the UUIDGenerator or simply using the Counter transformer to assign a number to each feature.

 

This transformer will test the strings of each feature that matches the ID provided. If you would like to test the fuzzy strings against each other (not dependent on IDs), I would suggest combining your two datasets with an Aggregator and using the FuzzyStringComparer.

 

Hope this helps! Evie


Reply