String Manipulation with RegEX

Question

Hi FME'ers

I want to be able to extract the organisation name out of a very messy string. Currently i have applied around 20 rules but there is one rule/concept that I just can't seem to work out.

Example String: The Random Organisation Random Street Randomshire RR20 2RR

I have another dataset that contains a list of all streets, i.e. 'Random Street' will be a record in this 'Street' dataset

How do i get FME to look through the street dataset and find in the string the name of the street and strip it, including the rest following the street, i.e. Random Street Randomshire RR20 2RR

Thanks!

david_r · Accepted Answer

Hi,I would first try to make a match / relation between the organisations and their respective street names. I would try to do this in the database and not using FME, using something like (untested):select *from organisation, addresswhere address.street like '%' || organisation.name || '%'You could do this with a SQLCreator, for instance. The result should be something one row for each organisation with the matching street name, like:org_namestreet_nameIt would then be a simple matter of using a StringSearcher with a regexp that returns the part of org_name that preceeds street_name.David

takashi · Answer

Hi Kam,

If the string always consists of: organization name (one or more any characters) <space> street name (one or more any characters, except space) <space> 'Street' <space> shire name (one or more any characters, except space) <space> lot number (one or more any characters) StringSearcher with the following expression would extract the elements: ^(.+)\\s(\^\\s]+\\sStreet)\\s(\^\\s]+)\\s(.+)$

Takashi

Reply

Community Stats

Reply

Community Stats

Sign up

An FME Account is required to contribute

Login to the community

An FME Account is required to contribute

Scanning file for viruses.

This file cannot be downloaded