Solved

Trouble in reading a csv file

4 years ago
March 23, 2021
2 replies
259 views

+10

aguan
Contributor
134 replies

I have a csv file that has 259200 rows. The first column CELL_ID has consecutive integers from 1 to 259200. When used this file in the CSV reader, its data type is automatically set to be uint16, which limits its max value to be 65535, consequently all other rows after 65535 are read as NULL. The data type can't be changed. I also notice other issues. For example, one column has many initial rows with a value of 0, so the CSV reader set its data type to be uint8. But this column has many rows with data type of double, and those end up as being read as null . It would be nice if all columns can be changed to be read as string so we don't miss anything. I would say the default settings are not correct in most cases. I have used the CSV reader for years, and this is the first time I notice the bad default settings.

Best answer by fhilding

Hi! I also encounter this in files where the values are sorted. Two things that you can do to fix this:

1) Click parameters after you've selected the file in the CSV reader, to see more about the feature type you're about to read. In there, you can (the first red circle), set it to read more rows when assessing the schema.

2) You can also go to manual -> set the type to the uint64 it almost always is for numeric ID columns.

Sometimes, I'd appreciate if the schema scanner checked the first X rows, as well as the _last_ row, just to see if we're in the case of "oh, this looks like an ordered file". But, until FME magically does that for me, I'll have to remind myself to check either one of the two options.

View original

Did this help you find an answer to your question?

F

+1

fhilding
56 replies
Best Answer
4 years ago
March 24, 2021

Hi! I also encounter this in files where the values are sorted. Two things that you can do to fix this:

1) Click parameters after you've selected the file in the CSV reader, to see more about the feature type you're about to read. In there, you can (the first red circle), set it to read more rows when assessing the schema.

2) You can also go to manual -> set the type to the uint64 it almost always is for numeric ID columns.

Sometimes, I'd appreciate if the schema scanner checked the first X rows, as well as the _last_ row, just to see if we're in the case of "oh, this looks like an ordered file". But, until FME magically does that for me, I'll have to remind myself to check either one of the two options.

1 Attachments

+10

aguan
Author
Contributor
134 replies
4 years ago
March 24, 2021

fhilding wrote:

Hi! I also encounter this in files where the values are sorted. Two things that you can do to fix this:

1) Click parameters after you've selected the file in the CSV reader, to see more about the feature type you're about to read. In there, you can (the first red circle), set it to read more rows when assessing the schema.

2) You can also go to manual -> set the type to the uint64 it almost always is for numeric ID columns.

Sometimes, I'd appreciate if the schema scanner checked the first X rows, as well as the _last_ row, just to see if we're in the case of "oh, this looks like an ordered file". But, until FME magically does that for me, I'll have to remind myself to check either one of the two options.

@fhilsding, thanks. I also found the data types can be changed by a manual setting in the Parameters when setting up the reader. All columns are read correctly now.

Trouble in reading a csv file

2 replies

1 Attachments

Reply

Helpful Members This Week

Recently Solved Questions

OGC WMS Reader gives a coarse resolution output

Detecting Digitization Direction Conflicts in Consecutive Lines

PythonCreator Code Error

Has anyone ever created Flow APP for public survey purpose?

Point clouds - find the lowest height in clusters

Community Stats

Latest FME

Cookie policy

Cookie settings

1 Attachments

Reply

Related Topics

Having trouble reading OS Open Names. This product is supplied as multiple (headerless) CSVs packed into multiple sets of zipped folders. I want to create a single reader and assign column names.icon

Excel reader: Could not open file, file is too large...icon

Reading XML file and exporting excel or csvicon

Way of specifying a floating point attribute's width and precision when converting from csv to shape?icon

What's the best way to handle multiple files with a single notification to trigger downstream workspace?icon

Helpful Members This Week

Recently Solved Questions

OGC WMS Reader gives a coarse resolution output

Detecting Digitization Direction Conflicts in Consecutive Lines

PythonCreator Code Error

Has anyone ever created Flow APP for public survey purpose?

Point clouds - find the lowest height in clusters

Popular Tags

Community Stats

Latest FME

Sign up

An FME Account is required to contribute

Login to the community

An FME Account is required to contribute

Scanning file for viruses.

This file cannot be downloaded

Cookie policy

Cookie settings