Solved

Matrix of all unique values per attribute field of a dataset

7 years ago
January 31, 2018
5 replies
94 views

miikamakela
19 replies

Hi,

I have a new source with poor metadata. I would like to create a quick overview of the dataset by producing a matrix, where I have each attribute field and all unique values within that field listed. With aggregator or duplicate filter I can easily do one. But I have close to 100 fields, so there must be a more clever way...

Thanks for any suggestions!

Best answer by nielsgerrits

AttributeExploder to create features of cells then aggregate by _attribute_name.

View original

Did this help you find an answer to your question?

+50

redgeographics
Celebrity
3647 replies
7 years ago
January 31, 2018

If you use a Sampler, set it to pass only the first feature and then group it by all the fields you want to check it should output all unique attribute combinations for those fields. Not quite what you're looking for, but getting there.

miikamakela
Author
19 replies
7 years ago
January 31, 2018

redgeographics wrote:

thanks! I was thinking something similar with the aggregator, but indeed the answer is not quite there yet. need to remove all field level duplicates as there are a few million rows...

miikamakela
Author
19 replies
7 years ago
January 31, 2018

Currently thinking in the lines of "Compute histograms" within the StatisticsCalculator transformer. But have not figured the way yet...

+56

nielsgerrits
2870 replies
Best Answer
7 years ago
January 31, 2018

AttributeExploder to create features of cells then aggregate by _attribute_name.

miikamakela
Author
19 replies
7 years ago
January 31, 2018

nielsgerrits wrote:

AttributeExploder to create features of cells then aggregate by _attribute_name.

Yes! this is exactly what I wanted. Thanks!

Reply

Cookie policy

We use cookies to enhance and personalize your experience. If you accept you agree to our full cookie policy. Learn more about our cookies.

Cookie settings

We use 3 different kinds of cookies. You can choose which cookies you want to accept. We need basic cookies to make this site work, therefore these are the minimum you can select. Learn more about our cookies.

Basic
Functional

Normal
Functional + analytics

Complete
Functional + analytics + social media + embedded videos + marketing

Matrix of all unique values per attribute field of a dataset