Solved

Group By and pick up maximum value rows

7 years ago
August 8, 2017
8 replies
398 views

submi
10 replies

Hi guys,

I have data which look like this:

|ID|Number|

|1|1|

|1|3|

|2|1|

|3|1|

|3|2|

|3|3|

I need to group them by ID and pick up only rows with highest Number value. To get such data:

|ID|Number|

|1|3|

|2|1|

|3|3|

Any idea how to accomplish this?

Thanks

Best answer by ebygomm

Sort by number in descending order and then a duplicate filter on the ID would also work. ID with highest number value would be output via Unique port

View original

Did this help you find an answer to your question?

david_r
8356 replies
7 years ago
August 8, 2017

Here's one possible solution:

Aggregator with Group By on ID, generate list "Numbers"
ListSorter descending on list Numbers{}
ListIndexer to retrieve item 0 (highest value) from list Numbers{}

Am sure there are others.

submi
Author
10 replies
7 years ago
August 8, 2017

david_r wrote:

Here's one possible solution:

Aggregator with Group By on ID, generate list "Numbers"
ListSorter descending on list Numbers{}
ListIndexer to retrieve item 0 (highest value) from list Numbers{}

Am sure there are others.

It seems to work fine after fast check. Thanks

+39

ebygomm
Influencer
3330 replies
Best Answer
7 years ago
August 8, 2017

Sort by number in descending order and then a duplicate filter on the ID would also work. ID with highest number value would be output via Unique port

submi
Author
10 replies
7 years ago
August 8, 2017

ebygomm wrote:

Sort by number in descending order and then a duplicate filter on the ID would also work. ID with highest number value would be output via Unique port

Seems simpler, didn't know about this highest value in duplicated filter funcianality.

david_r
8356 replies
7 years ago
August 8, 2017

submi wrote:

Seems simpler, didn't know about this highest value in duplicated filter funcianality.

It's because the DuplicateFilter preserves the input order, which in this case has been established by the Sorter. Pretty elegant solution, I agree.

+39

ebygomm
Influencer
3330 replies
7 years ago
August 8, 2017

You could also use a sorter to sort in descending order followed by the sampler with a group by on the ID field and a sampling rate of 1 with a sampling type of First N Features

takashi
7726 replies
7 years ago
August 8, 2017

I agree that the combination of a Sorter and a DuplcateFilter (or a Sampler) is a very elegant way, but there should be several ways as @david_r mentioned. These four ways flashed into my mind so far.

Basic statistics: StatisticsCalculator (Group By: ID, Attributes to Analyze: Number, Maximum Attribute: Number)
SQL application: InlineQuerier
XQuery & JSON application: Sampler(or DuplicateFilter) + JSONTemplater + JSONFlattener
Tcl application: Aggregator + TclCaller

FYI.

+49

mark2atsafe
Safer
2523 replies
7 years ago
August 8, 2017

takashi wrote:

Basic statistics: StatisticsCalculator (Group By: ID, Attributes to Analyze: Number, Maximum Attribute: Number)
SQL application: InlineQuerier
XQuery & JSON application: Sampler(or DuplicateFilter) + JSONTemplater + JSONFlattener
Tcl application: Aggregator + TclCaller

FYI.

The StatisticsCalculator was what occurred to me first too. I like that way. But like you say there are so many methods.

Reply

Rich Text Editor, editor1

Cookie policy

We use cookies to enhance and personalize your experience. If you accept you agree to our full cookie policy. Learn more about our cookies.

Cookie settings

We use 3 different kinds of cookies. You can choose which cookies you want to accept. We need basic cookies to make this site work, therefore these are the minimum you can select. Learn more about our cookies.

Basic
Functional

Normal
Functional + analytics

Complete
Functional + analytics + social media + embedded videos + marketing

Group By and pick up maximum value rows