Question

How many of us testing data in FME?

2 years ago
November 10, 2022
4 replies
34 views

+1

michpil
Contributor
9 replies

This is opened question to FME Community. FME workspaces are very individual. But I believe there is a way quite universal way to test it (FME Testing Framework, rTest, DatasetValidator, Test Custom Transformer). This topic is about data quality. It is crucial!. How do you test your data in FME?

+50

redgeographics
Celebrity
3643 replies
2 years ago
November 14, 2022

As you say, it is very individual. In broad terms, I would first check data structure (i.e. the schema), then content.

The one thing that I think is hard to automate is the "sanity check" on the data. Taking a look at the volume and considering whether or not that is an expected amount. If you often process data for local governments you kinda get an idea of the relationship between for example the number of inhabitants vs the number of addresses. I.e. if I process data for a municipality with 100.000 inhabitants but I only have 5.000 addresses I can quite confidently say my data is incomplete.

+1

michpil
Author
Contributor
9 replies
2 years ago
November 16, 2022

redgeographics wrote:

As you say, it is very individual. In broad terms, I would first check data structure (i.e. the schema), then content.

The one thing that I think is hard to automate is the "sanity check" on the data. Taking a look at the volume and considering whether or not that is an expected amount. If you often process data for local governments you kinda get an idea of the relationship between for example the number of inhabitants vs the number of addresses. I.e. if I process data for a municipality with 100.000 inhabitants but I only have 5.000 addresses I can quite confidently say my data is incomplete.

Okay,

How do you testing schema and content? Manually?
How do you prepare expected data based on requirements?
How do you reporting it? I mean, do you have test cases with execution results passed/failed?

+50

redgeographics
Celebrity
3643 replies
2 years ago
November 17, 2022

michpil wrote:

Okay,

How do you testing schema and content? Manually?
How do you prepare expected data based on requirements?
How do you reporting it? I mean, do you have test cases with execution results passed/failed?

Re. #1: We've used the ChangeDetector on Schema features with some good results. Comparing a supplied schema with an expected schema and then reporting the differences.

+1

michpil
Author
Contributor
9 replies
2 years ago
November 17, 2022

michpil wrote:

Okay,

How do you testing schema and content? Manually?
How do you prepare expected data based on requirements?
How do you reporting it? I mean, do you have test cases with execution results passed/failed?

Okay, I had the same idea (below image). Now my FME Testing Framework is just FME Workbench which you can add (instead of ChangeDetector) with WorkspaceRunner parameters: actual in CSV, expected in CSV and in output you have Test Cases (based on CSV columns) Report in html (pytest) and xls. I plan to do it also for SHP.

Thanks

1 Attachments

Reply

Rich Text Editor, editor1

How many of us testing data in FME?

4 replies

1 Attachments

Reply

Helpful Members This Week

Recently Solved Questions

Read Settings from Delimited Text File

Generic source file name confusion? Or bad workflow?

Truncate SDE table with archiving enabled

Dissolver - Attributes to Sum and Multi Polygons:1+2 = 5

How to see which features have invalid source datasets when using a FeatureWrite?

Community Stats

Latest FME

Cookie policy

Cookie settings

1 Attachments

Reply

Related Topics

Combined TopoJSON file for multiple objects.icon

Bunch conversion from Text file to KMLicon

Blocks not written to Autocad drawingicon

Question of the Week: Multiple Reports with the HTMLReportGenerator Transformericon

Upload multiple files with the FME Server Javascript APIicon

Helpful Members This Week

Recently Solved Questions

Read Settings from Delimited Text File

Generic source file name confusion? Or bad workflow?

Truncate SDE table with archiving enabled

Dissolver - Attributes to Sum and Multi Polygons:1+2 = 5

How to see which features have invalid source datasets when using a FeatureWrite?

Popular Tags

Community Stats

Latest FME

Sign up

An FME Account is required to contribute

Login to the community

An FME Account is required to contribute

Scanning file for viruses.

This file cannot be downloaded

Cookie policy

Cookie settings