Solved

Aggregate table: Only keeping maximum values

4 years ago
December 9, 2020
7 replies
178 views

manumanul
6 replies

I have a point feature containing different entries per point (multiple measurements on different heights), and i want to aggregate my attribute table in a way that only one entry per X,Y-point exist.

The final entry should thereby only contain the maximum values of all measurements per points.

I tried using the Aggregator tool, however, I was unable to keep the maximum values, since the only implemented functions seem to be average and sum.

I think of using my own python script withih a PythonCaller, but it did not work either (I know it must work , but I am pretty new to both FME and Python and just did not succeed).

Any help much appreciated!

Best answer by ebygomm

manumanul wrote:

My data is rather like this...

In which case, you should just be able to use a StatisticsCalculator, Group by whatever ID you have for each point, choose the Measurements you want to analyse and get the _max

Capture You'd end up with something like this from the Summary port

Capture

View original

Did this help you find an answer to your question?

manumanul
Author
6 replies
4 years ago
December 9, 2020

I am more experienced with R, and using R I would solve it as follows:

# define data frame with test data
df=data.frame(
  name1=c("a","a","b","b","b"),
  name2=c("a","a","a","b","b"),
  value1=c(1,3,6,-9,10),
  value2=c(2,3,6,9,1),
  value3=c(10,3,3,29,2),
  value4=c(-1,34,6,9,6),
  value5=c(1,3,6,91,2)
)
 
# concatenate columns (as i want to group the df by 2 different cols)
df$list=paste0(df$name1,df$name2)
 
# aggregate df
df.aggregated=aggregate(x=df[,-c(1,2)],by=list(df$list),FUN=max)[,-1]
 
# re-add data of the two columns that the df was grouped by
comblist=match(df.aggregated$list,df$list)
combsorted=df[comblist,]
df.aggregated$name1=combsorted$name1
df.aggregated$name2=combsorted$name2

However, I did not manage to get RCaller work in FME...

+39

ebygomm
Influencer
3313 replies
4 years ago
December 9, 2020

Can you confirm, you have some data that looks like this Capture and you're aiming to get to something like this?

Capture

manumanul
Author
6 replies
4 years ago
December 9, 2020

ebygomm wrote:

Can you confirm, you have some data that looks like this Capture and you're aiming to get to something like this?

Capture

My data is rather like this...

1 Attachments

+39

ebygomm
Influencer
3313 replies
Best Answer
4 years ago
December 9, 2020

manumanul wrote:

My data is rather like this...

In which case, you should just be able to use a StatisticsCalculator, Group by whatever ID you have for each point, choose the Measurements you want to analyse and get the _max

Capture You'd end up with something like this from the Summary port

Capture

manumanul
Author
6 replies
4 years ago
December 9, 2020

manumanul wrote:

I am more experienced with R, and using R I would solve it as follows:

# define data frame with test data
df=data.frame(
  name1=c("a","a","b","b","b"),
  name2=c("a","a","a","b","b"),
  value1=c(1,3,6,-9,10),
  value2=c(2,3,6,9,1),
  value3=c(10,3,3,29,2),
  value4=c(-1,34,6,9,6),
  value5=c(1,3,6,91,2)
)
 
# concatenate columns (as i want to group the df by 2 different cols)
df$list=paste0(df$name1,df$name2)
 
# aggregate df
df.aggregated=aggregate(x=df[,-c(1,2)],by=list(df$list),FUN=max)[,-1]
 
# re-add data of the two columns that the df was grouped by
comblist=match(df.aggregated$list,df$list)
combsorted=df[comblist,]
df.aggregated$name1=combsorted$name1
df.aggregated$name2=combsorted$name2

However, I did not manage to get RCaller work in FME...

By specifying the R interpreter (see http://docs.safe.com/fme/html/FME_Desktop_Documentation/FME_Transformers/Transformers/rcaller.htm) and the installation of many R packages (see https://community.safe.com/s/question/0D54Q000080hcFESAY/rcaller-no-package-called-sqldf) I finally managed to get my R code working.

The code above worked and did exactly what I was looking for. I just had to adjust my input dataframe (instead of my test-df I defined df as my input data from the previous FME Transformer)

df=Passed

and define an output at the end of my code

fmeOutput<-df.aggregated

However, I lost my spatial references on the way and thus had to redefine this, which was no problem...

1 Attachments

manumanul
Author
6 replies
4 years ago
December 9, 2020

manumanul wrote:

My data is rather like this...

Perfect! That's exactly what I was looking for. However, is there a possibility to keep the other attributes (those of which I did not calculate any statistics)?

+39

ebygomm
Influencer
3313 replies
4 years ago
December 9, 2020

manumanul wrote:

My data is rather like this...

If the values are shared across the features, you can add them to the group by. If they vary across features, you need to decide how you will decide which value to keep. If you look at the data coming from the Complete port, that will contain the min and max as extra attributes against the original data.

Reply

Cookie policy

We use cookies to enhance and personalize your experience. If you accept you agree to our full cookie policy. Learn more about our cookies.

Cookie settings

We use 3 different kinds of cookies. You can choose which cookies you want to accept. We need basic cookies to make this site work, therefore these are the minimum you can select. Learn more about our cookies.

Basic
Functional

Normal
Functional + analytics

Complete
Functional + analytics + social media + embedded videos + marketing

Aggregate table: Only keeping maximum values

1 Attachments

7 replies

1 Attachments

1 Attachments

Reply

Helpful Members This Week

Recently Solved Questions

How to get a list of Asana tasks with their corresponding custom field values?

Using one AttributeRounder for different accuracies

Create date segments of two table with overlap of times

Automate Fanout of columns/splitting attributes to different output by attribute name

Tracing Multiple Networks from Sources to Valves Without Python

Community Stats

Latest FME

Cookie policy

Cookie settings

1 Attachments

1 Attachments

1 Attachments

Reply

Related Topics

StoreKitTest rereceipt is not valid erroricon

Error: The receipt is not valid on Catalysticon

iOS 'Receipt is invalid' Megathread

Not seeing sandbox purchases (non-subscription)icon

Receipt is not valid when purchasing package in Sandboxicon

Helpful Members This Week

Recently Solved Questions

How to get a list of Asana tasks with their corresponding custom field values?

Using one AttributeRounder for different accuracies

Create date segments of two table with overlap of times

Automate Fanout of columns/splitting attributes to different output by attribute name

Tracing Multiple Networks from Sources to Valves Without Python

Popular Tags

Community Stats

Latest FME

Sign up

An FME Account is required to contribute

Login to the community

An FME Account is required to contribute

Scanning file for viruses.

This file cannot be downloaded

Cookie policy

Cookie settings