Hello! I’m trying to figure out the best way to aggregate values by the most common words. I’ve found a few threads and documents but not quite what I was looking for.
For example, I have a list of building names and numbers where each entry can have a of variation of a building name and number:
“1000 The Coolest Building Ever”
“1000 Coolest Building”
“Coolest Building”
“100 Coolest Building Dr.”
I would like the output to be “Coolest Building”, as it has common base words across all features. Is this possible?
Bonus point if a variation of “Bldg.” “Bldg” can be included. Any advice/guidance is appreciated!