Solved

DuckDB Summarize command

8 months ago
November 2, 2024
4 replies
97 views

+12

oliver.morris
Contributor
176 replies

I am trying to use the duckdb summarize command

https://duckdb.org/docs/archive/0.9/guides/meta/summarize

in an in memory duckdb using the sqlexecutor:

SUMMARIZE SELECT * FROM read_parquet('/Users/x/Downloads/Address.parquet');

This doesn’t return any results/attributes but ‘runs’ successfully in FME. In DBeaver it runs no issues.

Any idea is there is a trick to get this working?

Many ThanksQ!

Best answer by arnold_bijlsma

@oliver.morris : There is a very simple workaround: wrap the SUMMARIZE or DESCRIBE query in another SELECT * FROM clause:

SELECT * FROM (SUMMARIZE SELECT * FROM read_csv('https://oedi-data-lake.s3.amazonaws.com/pvdaq/csv/systems.csv') );

And in DuckDB’s SQL dialect you don’t explicitly need the “SELECT *” or the read_csv() function, so this could be abbreviated to:

FROM ( SUMMARIZE FROM 'https://oedi-data-lake.s3.amazonaws.com/pvdaq/csv/systems.csv' );

View original

Did this help you find an answer to your question?

+26

bwn
Evangelist
562 replies
8 months ago
November 3, 2024

Have you exposed the Attributes on the Workspace?

“SELECT *” gives no clues to SQLExecutor as to what Workspace Attribute Names to create for generated Workspace Features. This will need to be set in the Attributes to Expose Parameter, or in a downstream AttributeExposer

If you however view the results in Data Inspector you will see on any individual Feature in the Feature Information any unexposed Attribute values.

+14

arnold_bijlsma
Enthusiast
123 replies
7 months ago
December 6, 2024

@oliver.morris : I’m getting the same: no output when using SUMMARIZE, not even unexposed output.

And its sister statement DESCRIBE even gives an error:

+14

arnold_bijlsma
Enthusiast
123 replies
Best Answer
7 months ago
December 6, 2024

@oliver.morris : There is a very simple workaround: wrap the SUMMARIZE or DESCRIBE query in another SELECT * FROM clause:

SELECT * FROM (SUMMARIZE SELECT * FROM read_csv('https://oedi-data-lake.s3.amazonaws.com/pvdaq/csv/systems.csv') );

And in DuckDB’s SQL dialect you don’t explicitly need the “SELECT *” or the read_csv() function, so this could be abbreviated to:

FROM ( SUMMARIZE FROM 'https://oedi-data-lake.s3.amazonaws.com/pvdaq/csv/systems.csv' );

+12

oliver.morris
Author
Contributor
176 replies
7 months ago
December 8, 2024

arnold_bijlsma wrote:

@oliver.morris : There is a very simple workaround: wrap the SUMMARIZE or DESCRIBE query in another SELECT * FROM clause:

SELECT * FROM (SUMMARIZE SELECT * FROM read_csv('https://oedi-data-lake.s3.amazonaws.com/pvdaq/csv/systems.csv') );

And in DuckDB’s SQL dialect you don’t explicitly need the “SELECT *” or the read_csv() function, so this could be abbreviated to:

FROM ( SUMMARIZE FROM 'https://oedi-data-lake.s3.amazonaws.com/pvdaq/csv/systems.csv' );

@arnold_bijlsma nice solution, very cool - I think this means I can build a little hub transformer to summarise a whole set of file types super quick!

Reply

Rich Text Editor, editor1

Cookie policy

We use cookies to enhance and personalize your experience. If you accept you agree to our full cookie policy. Learn more about our cookies.

Cookie settings

We use 3 different kinds of cookies. You can choose which cookies you want to accept. We need basic cookies to make this site work, therefore these are the minimum you can select. Learn more about our cookies.

Basic
Functional

Normal
Functional + analytics

Complete
Functional + analytics + social media + embedded videos + marketing

DuckDB Summarize command

4 replies

Reply

Helpful Members This Week

Recently Solved Questions

How to restart a REST Server in ArcGIS Server?

Remove last CR/LF from a CSV

1019 error with change detector and polygons

Where is the "Show Bookmark Navigator" option in FME 2024.2?

How to dynamically write new or update existing ArcGIS Online Feature Layers.

Community Stats

Latest FME

Cookie policy

Cookie settings

Reply

Related Topics

Create error report containing failed tester attributesicon

Attribute Validator - Populate _fme_validation_message (string) with a concatena

How to to find empty fields in columns ?icon

Updating Multipoint Feature on ArcGIS Onlineicon

Dynamic buffer size from polygonicon

Helpful Members This Week

Recently Solved Questions

How to restart a REST Server in ArcGIS Server?

Remove last CR/LF from a CSV

1019 error with change detector and polygons

Where is the "Show Bookmark Navigator" option in FME 2024.2?

How to dynamically write new or update existing ArcGIS Online Feature Layers.

Popular Tags

Community Stats

Latest FME

Sign up

An FME Account is required to contribute

Login to the community

An FME Account is required to contribute

Scanning file for viruses.

This file cannot be downloaded

Cookie policy

Cookie settings