Skip to main content

Hi,

A users' progress is being tracked being creating new rows of data, with an incrementing counter in a column like so:

I want to only keep the rows with the highest counter value for each users (A, 5; B,3; etc). So only keep the highlighted rows. How can I do this in workbench?

Thanks!

Have a look at the StatisticsCalculator (group by user, the _max will have the required value).

Use output port SUMMARY.


@mvdorst, there are a number of ways you could do this, but I think the easiest is probably to use a StatisticsCalculator transformer - analyse the counter value/second column, and calculate the maximum, having also included a Group By clause on the name/first column.

Example workflow below:

 


Reply