Skip to main content
Question

Compare Text to each other

  • August 12, 2022
  • 3 replies
  • 192 views

boubcher
Contributor
Forum|alt.badge.img+11

Hello there I have a 20K features each feature has an attribute with Long text content each text with diffirent word count , we need to compare features to each other and extract the text features that have similarity ,

Any way to do this with FME

Thanks

3 replies

danilo_fme
Celebrity
Forum|alt.badge.img+52
  • Celebrity
  • August 12, 2022

Hi @boubcher​ how are you?

 

Did you try the Fuzzy transformer?

 

Thanks in Advance,


boubcher
Contributor
Forum|alt.badge.img+11
  • Author
  • Contributor
  • August 13, 2022

Hi @boubcher​ how are you?

 

Did you try the Fuzzy transformer?

 

Thanks in Advance,

@danilo_fme​ 

Thanks for your replay

I think the fuzzystringcompareFrom2 Datasets transformer will work

I did configure the input for the 2 dataset to compare how about Id feature Type parameter do you have any idea what it s there for

Thanks

imageimage


evieatsafe
Safer
  • Safer
  • August 16, 2022

@danilo_fme​ 

Thanks for your replay

I think the fuzzystringcompareFrom2 Datasets transformer will work

I did configure the input for the 2 dataset to compare how about Id feature Type parameter do you have any idea what it s there for

Thanks

imageimage

hi @boubcher​ do you have unique feature IDs from your two datasets? If not you can create them by using the UUIDGenerator or simply using the Counter transformer to assign a number to each feature.

 

This transformer will test the strings of each feature that matches the ID provided. If you would like to test the fuzzy strings against each other (not dependent on IDs), I would suggest combining your two datasets with an Aggregator and using the FuzzyStringComparer.

 

Hope this helps! Evie