Parse CSV data stored in database table

Question

Hi FME'er,I have CSV data that is stored in a table in Oracle. The data is created using a stored procedure and is stored in a single field with one row per line. I would like to read and parse this data as a CSV file into FME for further processing. I figured I would start by aggregating the field (attribute) into a single row but don't know where to go from there. I can't use the AttributeSplitter because it uses " qualifiers around data that has commas.  Many Thanks,David

david_r · Accepted Answer

I'mthinkingthataPythonCallerwiththeCSVmoduleisperfectforthis,italreadyhasallthenecessarymechanismsfordealingwithlotsofedgecaseslikequotations,newlines,etc.Trysomethinglike:fromfmeobjectsimport*importcsvclassParseCSVString(object):definput(self,feature):csv_string=[feature.getAttribute('CSV_LINE')or'']csv_parser=csv.reader(csv_string)forrecordincsv_parser:f=feature.clone()forn,valueinenumerate(record):f.setAttribute('value{%s}'%n,value)self.pyoutput(f)Exposetheattributelist"value{}"inthePythonCaller.csv-line.fmwt

gio · Answer

You can simple read this field. You would then get an attribute with the csv in it.

Separate/split by newline (texteditor->specialcharacters->newline). Explode list.

Connect a datainspector transformer.

(If fieldname row has different qualifers, separate it by choosing _element_index = 0, to treat is separately. Else, no need to.)

Export the fieldnames list to a txt list by copying it from the feature information window in the Data inspector. (select the list{} attribute names and the attribute values with "copy text with indentation" and save txt file.

If there is no attribute value containing a comma, split by comma, else first use stringreplacer to replace by for instance a backslash (again via texteditor or copying a backslash form another file). Then a stringreplacer to remove the qualifiers, use regexp so you can get the first and last (as to not remove non qualifying ') by regexp= ^'|'$

Now u can split by backslash (or whatever you chose).

Use a renamer to rename the _list{} attributes using the import function. (a addition in recent fme versions I am very happy about)

Parse CSV data stored in database table

4 replies

Reply

Community Stats

Reply

Community Stats

Sign up

An FME Account is required to contribute

Login to the community

An FME Account is required to contribute

Scanning file for viruses.

This file cannot be downloaded