Skip to main content
Solved

data examination

  • September 9, 2024
  • 2 replies
  • 77 views

nanchuangyeyu
Contributor
Forum|alt.badge.img+1

Hi everyone, I am now doing data quality inspection to an esri personal geo database with multiple layers, and I have to calculate how many 0 or null values there are in every attribute of each layer(or table) in the database. I have tried to figure out some automation method with FME but in vain. Can anybody give some instructon or advice? 

Best answer by bwn

I guess one solution generally is to use NullAttributeMapper + ChangeDetector.

Although there are higher performance solutions in coding a check into PythonCaller to pass through Features and perform a per Feature Check with Eg. fmeobjects.FMEFeature.isAttributeNull() method, but won’t go into that here.


Starting with sample data.

 

Send these to NullAttributeMappers to overwrite the original Null and Zero values with flagging text strings “NullValue” and “ZeroValue”.   Then configure ChangeDetector to build per Feature the Lists of Attribute Names that had either a Zero or Null.

 

Finally send this to StatisticsCalculator to get an overview of how many Nulls and Zeros per Feature Class/Table.  Note the “Group By Mode” and “Group By” Parameters need to be set carefully.


Gives the summary Count of Zeros and Nulls per Feature Class/Table Name, per Field Name.

 

View original
Did this help you find an answer to your question?

2 replies

bwn
Evangelist
Forum|alt.badge.img+26
  • Evangelist
  • Best Answer
  • September 10, 2024

I guess one solution generally is to use NullAttributeMapper + ChangeDetector.

Although there are higher performance solutions in coding a check into PythonCaller to pass through Features and perform a per Feature Check with Eg. fmeobjects.FMEFeature.isAttributeNull() method, but won’t go into that here.


Starting with sample data.

 

Send these to NullAttributeMappers to overwrite the original Null and Zero values with flagging text strings “NullValue” and “ZeroValue”.   Then configure ChangeDetector to build per Feature the Lists of Attribute Names that had either a Zero or Null.

 

Finally send this to StatisticsCalculator to get an overview of how many Nulls and Zeros per Feature Class/Table.  Note the “Group By Mode” and “Group By” Parameters need to be set carefully.


Gives the summary Count of Zeros and Nulls per Feature Class/Table Name, per Field Name.

 


nanchuangyeyu
Contributor
Forum|alt.badge.img+1
  • Author
  • Contributor
  • September 10, 2024
bwn wrote:

I guess one solution generally is to use NullAttributeMapper + ChangeDetector.

Although there are higher performance solutions in coding a check into PythonCaller to pass through Features and perform a per Feature Check with Eg. fmeobjects.FMEFeature.isAttributeNull() method, but won’t go into that here.


Starting with sample data.

 

Send these to NullAttributeMappers to overwrite the original Null and Zero values with flagging text strings “NullValue” and “ZeroValue”.   Then configure ChangeDetector to build per Feature the Lists of Attribute Names that had either a Zero or Null.

 

Finally send this to StatisticsCalculator to get an overview of how many Nulls and Zeros per Feature Class/Table.  Note the “Group By Mode” and “Group By” Parameters need to be set carefully.


Gives the summary Count of Zeros and Nulls per Feature Class/Table Name, per Field Name.

 

Thank you so much for your detailed instructuion. That really helped me out.


Cookie policy

We use cookies to enhance and personalize your experience. If you accept you agree to our full cookie policy. Learn more about our cookies.

 
Cookie settings