I have several county-wide 4 band GeoTiff imagery datasets that I need to review. Sometimes I identify the exact same image in two different tiles. I would like to automate a method for identifying this error. Does anyone have a workflow for identifying this or have any ideas on moving forward?
Hi @joeyb2018
There are so many transformers for Raster files.
The transformer RasterPropertyExtractor can to help you extracting the properties of a raster file RasterPropertyExtractor
After you can use the transformer Tester to validate the informations between Source datas.
Thanks,
Danilo
Hi @joeyb2018
There are so many transformers for Raster files.
The transformer RasterPropertyExtractor can to help you extracting the properties of a raster file RasterPropertyExtractor
After you can use the transformer Tester to validate the informations between Source datas.
Thanks,
Danilo
I think I should clarify the question a bit. So in this case, the imagery is being applied to the wrong image tile. This often times creates duplicate images within the entire imagery set for the county. As such, I am not sure which raster properties would be useful in making a comparison. Is there a method that could compare histograms for the entire imagery dataset, assuming that each unique image will have a unique histogram?
Thanks,
Joseph
Hi @joeyb2018
There are so many transformers for Raster files.
The transformer RasterPropertyExtractor can to help you extracting the properties of a raster file RasterPropertyExtractor
After you can use the transformer Tester to validate the informations between Source datas.
Thanks,
Danilo
I forgot to mention that when the imagery is also taking on the same name as the tile. So same image, two tiles, two names.
I forgot to mention that when the imagery is also taking on the same name as the tile. So same image, two tiles, two names.
Please, see the custom transformer RasterStatisticCalculator.
You basically need to compare each image to all the others (irrespective of name) to make sure that aren't duplicates. I wonder if doing checksum comparisons might work here. See the CRCCalculator.
Alternatively, send them to the Matcher, though that might be considerably slower.
EDIT: If the tiles have embedded georeferencing, they won't be detected as matches, even if they are pixel-wise identical, you'll want to strip that before either transformer. (RasterGeoreferencer Upper Left set to 0, spacing set to 1)
You basically need to compare each image to all the others (irrespective of name) to make sure that aren't duplicates. I wonder if doing checksum comparisons might work here. See the CRCCalculator.
Alternatively, send them to the Matcher, though that might be considerably slower.
EDIT: If the tiles have embedded georeferencing, they won't be detected as matches, even if they are pixel-wise identical, you'll want to strip that before either transformer. (RasterGeoreferencer Upper Left set to 0, spacing set to 1)
This doesn't help of your tiles are in some sort of slippy map and have all the same properties.