Question

Defining columns in online text documents

1 year ago
July 21, 2023
7 replies
71 views

smithgk
Contributor
8 replies

Hello FME Experts that are far smarter than me,

I have a very interesting text file that I need to isolate columns for to map points via lat and long values. Below is the URL to the dataset, it relates to fire weather conditions that are updated daily.

https://www.wfas.net/images/firedanger/fdr_obs.txt

I need to isolate the Colorado-related records and put all the values into their designated columns. If I use the standard text file reader, everything is mashed into one column (see screenshot). I tried using the CAT Reader, but can't seem to get that option to the right place.

Does anyone know a better Reader or combination of Transformers that could help delineate these columns of data?

+21

kailinatsafe
Safer
717 replies
1 year ago
July 25, 2023

Hello @smithgk, hmm.. unfortunately, I haven't been able to find a way to isolate/extract just Colorado records from the entire dataset😔 I think subsetting the Colorado records from the entire file will require a lot of AttributeSplitters/ListExploding/joining/etc - not impossible though!

If you're able to try copying/pasting the Colorado records (or records of interest) to a seperate text file, the CAT Reader works flawlessly!

Hope this helps, Kailin.

+51

geomancer
Evangelist
889 replies
1 year ago
July 26, 2023

It is not too difficult to extract the Colorado data. Read the entire text file at once. Use a StringSearcher with this regular expression (all * need to be escaped) to extract all Colorado data:

.+(\*\*\*\*\* Colorado \*\*\*\*\*[^\*]+)

Use the Subexpression Matches List Name to save the results in a list. Write the results to a text file.

Colorado

+51

geomancer
Evangelist
889 replies
1 year ago
July 26, 2023

Or continue processing the resulting data, probably starting with an AttributeSplitter on <Newline \\n>, and a ListExploder. Something like this:

Colorado2

+21

kailinatsafe
Safer
717 replies
1 year ago
July 26, 2023

Hello @smithgk, jumping back in the conversation here - hope you don't mind! Great idea with the StringSearcher @geomancer! After another attempt, managed to extract the state information using the Adjacent Features function! Here is another potential solution for you!

After reading the text file, the AttributeCreator is getting the state name (using regex) from any lines that start with ***** <text> ***** and assigning any subsequent features (that are presumed to be data) the value of state from the previous feature. An Aggregator is used to combine back into a single attribute, and then temporarily written as a text file. Because the file will not be created until runtime, you can point the FeatureReader at a similar dataset with the same schema/formatting (in my case, I used the colorado.txt file I show'd in the first comment). Let me know if you have any questions at all or have issues running the workspace. Happy to help, Kailin.

smithgk
Author
Contributor
8 replies
1 year ago
July 26, 2023

geomancer wrote:

Or continue processing the resulting data, probably starting with an AttributeSplitter on <Newline \\n>, and a ListExploder. Something like this:

Colorado2

Thanks geomancer! This is a good idea, I'll see about inputting into the workflow. I appreciate it!

smithgk
Author
Contributor
8 replies
1 year ago
July 26, 2023

kailinatsafe wrote:

Thank you Kailin! This looks like a nice and agile workflow that I'll definitely try out. Over the last few days I did come up with a solution too using the AttributeSplitter transformer, though it did take some trial and error. I first read the text file into Excel and used the Text to Columns function. I used the 'fixed width' option and wrote down the literal individual spaces that separate the attributes. Everything displayed in Excel perfect using that method.

I then went to FME, and read the text data using a Text Reader; I used the 'Number of Lines to Skip' and 'Number of Footer Lines to Skip' parameters to just grab the Colorado records. Using the AttributeSplitter transformer, I then input the exact amount of spaces that separate attributes using the 'Format String' option and format (see screenshot). The transformer split the data out into list values, which I then used an AttributeManager transformer to rename.

Capture I know this workflow may not be as agile and flexible as previous solutions, but it does get the data to where it needs to be since the format/layout of the text file will never change (column values will only change). That said, I'll definitely be trying out the other workflows to replicate the process and see if I can get something that's more adaptable to possible future database changes. Thank you Kailin!

+51

geomancer
Evangelist
889 replies
1 year ago
July 27, 2023

geomancer wrote:

Or continue processing the resulting data, probably starting with an AttributeSplitter on <Newline \n>, and a ListExploder. Something like this:

Colorado2

Hi, I see I forgot to add the workspace. I've attached it below.

The AttributeManager reads all the different attributes, with Attribute Values like

@Trim(@Substring(@Value(Data),71,6))

Reply

Rich Text Editor, editor1

Cookie policy

We use cookies to enhance and personalize your experience. If you accept you agree to our full cookie policy. Learn more about our cookies.

Cookie settings

We use 3 different kinds of cookies. You can choose which cookies you want to accept. We need basic cookies to make this site work, therefore these are the minimum you can select. Learn more about our cookies.

Basic
Functional

Normal
Functional + analytics

Complete
Functional + analytics + social media + embedded videos + marketing

Defining columns in online text documents