Solved

Using regex select a full single line of text before a match

5 years ago
June 4, 2020
5 replies
1339 views

+3

sfb
Contributor
7 replies

I have a block of text separated by newline characters, e.g:

Some Text

Some More Text

Even More Text

PERIOD: 01/01/1990 TO 12/12/2020

What I'm attempting to do using regex is grab the entire line of text preceding the row beginning with PERIOD (i.e. "Even More Text"). In an online regex editor, the following expression successfully returns just the line containing "Even More Text":

^.*$(?=\\nPERIOD)

However, when I attempt to do the same in FME, it returns all lines above PERIOD. It seems as though in online editors the . includes all characters except newlines, whereas in FME it includes them? Is there a way to adjust multiline regex flags (or some other workaround) in FME to get the desired output?

Best answer by david_r

Why not use the AttributeSplitter to split the block of text by line, then send it to the ListExploder to process each line at a time. You can then use e.g. the AttributeCreator and the Adjacent feature mode to retrieve the previous line from the one you're processing:

View original

Did this help you find an answer to your question?

+14

arnold_bijlsma
Enthusiast
123 replies
5 years ago
June 4, 2020

Quantifiers are by definition greedy, matching as much as possible. By putting a ? behind your asterisk, it makes the quantifier lazy, matching as little as possible.

I don't know if it'll work, but try

^.*?$(?=\nPERIOD)

david_r
8355 replies
Best Answer
5 years ago
June 4, 2020

Why not use the AttributeSplitter to split the block of text by line, then send it to the ListExploder to process each line at a time. You can then use e.g. the AttributeCreator and the Adjacent feature mode to retrieve the previous line from the one you're processing:

+39

ebygomm
Influencer
3313 replies
5 years ago
June 4, 2020

You could use the regex to match everything but a newline

^[^\n]*(?=\nPERIOD)

But I would probably go with adjacent attribute mapping as mentioned by @david_r

+3

sfb
Author
Contributor
7 replies
5 years ago
June 5, 2020

ebygomm wrote:

You could use the regex to match everything but a newline

^[^\n]*(?=\nPERIOD)

But I would probably go with adjacent attribute mapping as mentioned by @david_r

Thank you. This regex returns the correct line I was after.

+3

sfb
Author
Contributor
7 replies
5 years ago
June 5, 2020

david_r wrote:

Why not use the AttributeSplitter to split the block of text by line, then send it to the ListExploder to process each line at a time. You can then use e.g. the AttributeCreator and the Adjacent feature mode to retrieve the previous line from the one you're processing:

Thanks David. This is an elegant alternative to using regex. Though it requires a few more transformers it might be preferable doing it this way to make the workbenches more usable for work colleagues.

Reply

Rich Text Editor, editor1

Using regex select a full single line of text before a match

5 replies

Reply

Helpful Members This Week

Recently Solved Questions

Using one AttributeRounder for different accuracies

Create date segments of two table with overlap of times

Automate Fanout of columns/splitting attributes to different output by attribute name

Tracing Multiple Networks from Sources to Valves Without Python

FME Flow version control how to use different branch

Community Stats

Latest FME

Cookie policy

Cookie settings

Reply

Related Topics

Tiktok or IG with low views and poor captionicon

Helpful Members This Week

Recently Solved Questions

Popular Tags

Community Stats

Latest FME

Sign up

An FME Account is required to contribute

Login to the community

An FME Account is required to contribute

Scanning file for viruses.

This file cannot be downloaded

Cookie policy

Cookie settings