Skip to main content
Solved

Matrix of all unique values per attribute field of a dataset

  • January 31, 2018
  • 5 replies
  • 127 views

Forum|alt.badge.img+1

Hi,

I have a new source with poor metadata. I would like to create a quick overview of the dataset by producing a matrix, where I have each attribute field and all unique values within that field listed. With aggregator or duplicate filter I can easily do one. But I have close to 100 fields, so there must be a more clever way...

Thanks for any suggestions!

Best answer by nielsgerrits

AttributeExploder to create features of cells then aggregate by _attribute_name.

This post is closed to further activity.
It may be an old question, an answered question, an implemented idea, or a notification-only post.
Please check post dates before relying on any information in a question or answer.
For follow-up or related questions, please post a new question or idea.
If there is a genuine update to be made, please contact us and request that the post is reopened.

5 replies

redgeographics
Celebrity
Forum|alt.badge.img+60
  • Celebrity
  • 3703 replies
  • January 31, 2018

If you use a Sampler, set it to pass only the first feature and then group it by all the fields you want to check it should output all unique attribute combinations for those fields. Not quite what you're looking for, but getting there.


Forum|alt.badge.img+1
  • Author
  • 19 replies
  • January 31, 2018

If you use a Sampler, set it to pass only the first feature and then group it by all the fields you want to check it should output all unique attribute combinations for those fields. Not quite what you're looking for, but getting there.

thanks! I was thinking something similar with the aggregator, but indeed the answer is not quite there yet. need to remove all field level duplicates as there are a few million rows...

 

 


Forum|alt.badge.img+1
  • Author
  • 19 replies
  • January 31, 2018
Currently thinking in the lines of "Compute histograms" within the StatisticsCalculator transformer. But have not figured the way yet...

 

 


nielsgerrits
VIP
Forum|alt.badge.img+60
  • 2940 replies
  • Best Answer
  • January 31, 2018

AttributeExploder to create features of cells then aggregate by _attribute_name.


Forum|alt.badge.img+1
  • Author
  • 19 replies
  • January 31, 2018

AttributeExploder to create features of cells then aggregate by _attribute_name.

Yes! this is exactly what I wanted. Thanks!