Question

How to remove the repeated words in FME

9 months ago
September 4, 2024
5 replies
171 views

chaoluo
Contributor
5 replies

Hi All,

Anyone could help me with this specific question in FME? I want to remove the repeated words in each row, e.g. A,B,C,A,B in a row, the A, B have been repeated twice, I would like to remove the repeated A and B only. please see attached the sample data. many thanks.

+54

nielsgerrits
2842 replies
9 months ago
September 4, 2024

One way to do this:

Create a unique ID for each row. (Counter)
Create a list for all PlayName elements. (AttributeSplitter)
Explode list to features (ListExploder)
Clean up unique values per ID (Sampler, group by ID).
Merge all rows back into one (Aggregator, group by ID, merge attributes)
Merge rows back to originals. (FeatureMerger, merge by ID)

+45

danilo_fme
Evangelist
2057 replies
9 months ago
September 4, 2024

Hi @chaoluo

You can use this logic:

+50

geomancer
Evangelist
884 replies
9 months ago
September 5, 2024

Use a list and some list manipulation transformers:

Use an AttributeSplitter to create a list from the PlayName elements
Use a ListDuplicateRemover to remove the duplicate elements from the list
Use a ListConcatenator to fill an attribute with the remaining elements from the list

1 Attachments

Remove_duplicate_values_from_attribute.zip

+26

bwn
Evangelist
562 replies
9 months ago
September 5, 2024

@chaoluo Similarly method would use identical to that proposed by @nielsgerrits , @danilo_fme and @geomancer

The only difference between the methods is whether to use ListExploder + DuplicateFilter after the AttributeSpliiter, or instead just use ListDuplicateRemover after the AttributeSplitter.

I’ve used both approaches, and it depends on how big the Lists become over how many Features. ListExploder + DuplicateFilter , despite needing more Transformers, can execute faster, as ListDuplicateRemover can be slow to traverse all the List Attributes and it has an overhead in having to rename whatever List Attributes are left after the duplicate all to new List index numbers.

So small->medium number of features and not a lot duplicates, ListDuplicateRemover approach above works fine as general approach, but if it performs slowly can look to trial the ListExploder + DuplicateFilter method instead.

The only extra tip is to think about an extra Sorter before the Aggregator (@nielsgerrits method) or ListSorter before the ListConcantenator ( @danilo_fme , @geomancer ) to get the values alphabetically sorted, comma-delimited in the final output. I do this a lot to reduce the amount of random ordering that flows into say a ChangeDetector where if I didn’t sort the list first before comma-delimiting it, ChangeDetector would keep flagging a record had “changed”, where the next run of the workspace slightly change order of values, but where otherwise the same text strings and would cause the write to database to have an excess number of updates needed only because of the sometimes randomness of the order in fields with Eg. comma-delimited values.

+11

ronnie.utter
Contributor
37 replies
9 months ago
September 5, 2024

@chaoluo . This workspace should work

its converts to lowercase while checking if its combined in the source and also check the usual used delim. But it does not choose if you want to keep for example Jurassic or jurassic if booth are in the string. So thats needs to bee tweeked in the flow or change in the source.
But this is something to start with.
I have no idea what HPNT or NPNT is so this combination in this example is probably wrong: )

after

1 Attachments

remove_repetable_values.zip

Reply

Cookie policy

We use cookies to enhance and personalize your experience. If you accept you agree to our full cookie policy. Learn more about our cookies.

Cookie settings

We use 3 different kinds of cookies. You can choose which cookies you want to accept. We need basic cookies to make this site work, therefore these are the minimum you can select. Learn more about our cookies.

Basic
Functional

Normal
Functional + analytics

Complete
Functional + analytics + social media + embedded videos + marketing

How to remove the repeated words in FME

5 replies

1 Attachments

1 Attachments

Reply

Helpful Members This Week

Recently Solved Questions

"Bulk" Concatenate Attributes

Use attribute in attribute creator adjacent fields settings

How do I stop ChangeDetector running if no features going into Revised?

pivot table long to wide not working with AttributePivoter

How to get the path/location of a transformer like the log files do?

Community Stats

Latest FME

Cookie policy

Cookie settings

1 Attachments

1 Attachments

Reply

Related Topics

How to implement 2 'annual' subscription packages? (SwiftUI)icon

Multiple offerings vs custom identifiersicon

Multi-quantity subscriptions and assigning subscriptions to other usersicon

Swapping Subscriptionsicon

Contribute to the Flutter SDK

Helpful Members This Week

Recently Solved Questions

"Bulk" Concatenate Attributes

Use attribute in attribute creator adjacent fields settings

How do I stop ChangeDetector running if no features going into Revised?

pivot table long to wide not working with AttributePivoter

How to get the path/location of a transformer like the log files do?

Popular Tags

Community Stats

Latest FME

Sign up

An FME Account is required to contribute

Login to the community

An FME Account is required to contribute

Scanning file for viruses.

This file cannot be downloaded

Cookie policy

Cookie settings