Skip to main content
Gathering Interest

New Reader/Writer: Apache Iceberg

Related products:Integrations
  • September 12, 2023
  • 9 replies
  • 220 views

t_hohenstrater
Contributor
Forum|alt.badge.img+2

Level up to the parquet format (updates and snapshot)

9 replies

LizAtSafe
Safer
Forum|alt.badge.img+21
  • Safer
  • October 29, 2024
OpenGathering Interest

LizAtSafe
Safer
Forum|alt.badge.img+21
  • Safer
  • October 29, 2024
OpenGathering Interest

LizAtSafe
Safer
Forum|alt.badge.img+21
  • Safer
  • November 4, 2025
Gathering InterestIn Development

adrian_farrell
Contributor
Forum|alt.badge.img+6

Is there any update on a planned release of this reader/writer?


LizAtSafe
Safer
Forum|alt.badge.img+21
  • Safer
  • March 4, 2026
In DevelopmentArchived

LizAtSafe
Safer
Forum|alt.badge.img+21
  • Safer
  • March 4, 2026
ArchivedOpen

PierreAtSafe
Safer
Forum|alt.badge.img+11

@adrian_farrell Sorry Adrian for the delayed reply here! We hit a little snag on the Iceberg front but it’s still on the roadmap for sometime later this year.

Cheers


LizAtSafe
Safer
Forum|alt.badge.img+21
  • Safer
  • March 31, 2026
OpenGathering Interest

PierreAtSafe
Safer
Forum|alt.badge.img+11

Hi everyone,

We are back in active planning on the Reader and Writer, and I want to make sure we build what you actually need first. If you are evaluating Iceberg or already using it, I would love concrete detail on your setup and the workflows you want FME to cover.

A few specific things that would help us prioritize:

  1. Read, write, or both? Which one would be the most important?
  2. Which catalog are you on (or planning to use): REST, AWS Glue, Hive Metastore, Nessie, Snowflake Open Catalog / Polaris, Unity, BigQuery, other?
  3. Which engines also touch these tables: Spark, Trino, Flink, Snowflake, BigQuery, Dremio, DuckDB, Athena, other?
  4. Where does the data live: S3, ADLS, GCS, MinIO, on-prem HDFS / POSIX?
  5. Rough table scale: rows, partitions, typical file sizes, how often you write.
  6. Write patterns you need: bulk load, append, full overwrite, row-level UPDATE / DELETE, MERGE / upsert, streaming?
  7. Geospatial: are you storing geometry in Iceberg today? As WKB in a binary column, as GeoParquet, waiting on native v3 Geometry?
  8. Any must-have features beyond the above: time travel, branches / tags for Write-Audit-Publish, partition or schema evolution, specific transforms?

An example workflow you are trying to build (source to Iceberg, Iceberg to target, or both) is worth more than a checklist answer. If your setup is sensitive, feel free to DM instead.

Thanks for the patience on this one. The more real use cases we hear, the better the first release will be.

Cheers,