Solved

Write outputs before translation is complete

9 years ago
April 4, 2016
4 replies
45 views

jamiefinney
22 replies

how would i alter a workspace to export files as they are finished rather than holding them all in memory / temp files until everything is processed?

the input is about 200gb of Mapinfo files so this is taxing my system quite a bit. FME seems to not write anything until its completely gone through all the files.

I have a feature reader that processes all the *.tab files in a folder and clips them where they intersect an input polygon it then saves the files into an identical folder / name structure.

FME 2016 ESRI Edition - 16gb ram i7 4790k

Best answer by redgeographics

Here's what I came up with, using the FME sample data which has a collection of shapefiles with contours of Vancouver, that I'm clipping using a neighborhood boundary. So different formats and a lot less data but in broad terms it's the same as what you're trying to do.

All of the reading is happening in the child workspace, the master workspace is just there to start the child workspace once per input file. Important to keep in mind here is that I've set the WorkspaceRunner to wait for the current job (child process) to complete before starting a new one. You can try running multiple job simultaneously and save time, but you may run into memory problems that way.

master.fmw

child.fmw

View original

Did this help you find an answer to your question?

+50

redgeographics
Celebrity
3643 replies
9 years ago
April 4, 2016

You could use a master/child approach. A master workspace with a "Directory and File Pathnames" reader calls a child workspace using a WorkspaceRunner. That child workspace is called once per feature, i.e. once for every .tab file, with that filename as parameter. The child workspace has a MapInfo reader and does the actual work. That way you don't have to read all of the 200Gb in memory at the same time.

+19

fmelizard
Safer
3725 replies
9 years ago
April 4, 2016

I was going to suggest using a FeatureWriter, but upon testing I believe it exhibits the same behaviour -- in a dataset fanout case, we only have 1 actual writer going at a time. For now, the best option does seem to be @redgeographics above. Sorry. We'll keep working...

J

jamiefinney
Author
22 replies
9 years ago
April 4, 2016

@redgeographics @daleatsafe

I've tried to get this working using a workspace runner and i can't seem to quite make it work.

The inputs are a MapInfo Reader - just a polygon with my region of interest

and more importantly the FeatureReader that contains the path and wildcard to parse through all the tab files in the directory.

when i try to do a workspace runner with the path and directory reader i think its only choosing the variable (windows_path) for the input clip region. as soon as i run it it says complete but doesn't seem to actually do anything.

+50

redgeographics
Celebrity
3643 replies
Best Answer
9 years ago
April 5, 2016

Here's what I came up with, using the FME sample data which has a collection of shapefiles with contours of Vancouver, that I'm clipping using a neighborhood boundary. So different formats and a lot less data but in broad terms it's the same as what you're trying to do.

All of the reading is happening in the child workspace, the master workspace is just there to start the child workspace once per input file. Important to keep in mind here is that I've set the WorkspaceRunner to wait for the current job (child process) to complete before starting a new one. You can try running multiple job simultaneously and save time, but you may run into memory problems that way.

master.fmw

child.fmw

Reply

Rich Text Editor, editor1

Write outputs before translation is complete

4 replies

Reply

Helpful Members This Week

Recently Solved Questions

Generic source file name confusion? Or bad workflow?

Truncate SDE table with archiving enabled

Dissolver - Attributes to Sum and Multi Polygons:1+2 = 5

How to see which features have invalid source datasets when using a FeatureWrite?

How to compare multiple AGOL Feature Services

Community Stats

Latest FME

Cookie policy

Cookie settings

Reply

Related Topics

Change list index according to patternicon

Hello, I'm using a string searcher to find "Dates" out of a "Comments" field. I have a column of all matches (Screen shot attached). How can i extract the most recent date out of all matches? Probably would help if I change "/" to another symboicon

FME Weekly Quiz Results: Xiaomeng Ren (March 2020-1)icon

Hi there, today I have a question about the SchemaMapper lookup table. The SchemaMapper filter allows me to define conditional clauses to perform attribute mappings based on specific conditions.icon

ListKeyValuePairExtractor - encoding erroricon

Helpful Members This Week

Recently Solved Questions

Generic source file name confusion? Or bad workflow?

Truncate SDE table with archiving enabled

Dissolver - Attributes to Sum and Multi Polygons:1+2 = 5

How to see which features have invalid source datasets when using a FeatureWrite?

How to compare multiple AGOL Feature Services

Popular Tags

Community Stats

Latest FME

Sign up

An FME Account is required to contribute

Login to the community

An FME Account is required to contribute

Scanning file for viruses.

This file cannot be downloaded

Cookie policy

Cookie settings