Skip to main content
Open

PythonCreator to support bulk mode output

Related products:FME FormIntegrationsTransformers
  • November 7, 2025
  • 3 replies
  • 109 views

bruceharold
Supporter
Forum|alt.badge.img+19

With libraries like DuckDB in your Python environment, as is the case where you either have ArcGIS Pro or Enterprise on your FME machine, or you’re using ArcGIS Data Interoperability, then it is easy to retrieve potentially millions of features from an S3-API compliant object store or other web source in seconds, but it’s a bottleneck to send them on into the workspace one by one.

I would like to see performance like reading CSV files brought to PythonCreator.

There is a possibly related existing idea Introduce a Python Dataframe Creator/Transformer but I don’t want to confound using dataframes with this idea, which is fundamentally about performance.

Dataframes might be how this idea is implemented, but my guess is that would be a heavy lift for Safe, another way might be a way to output an aggregate feature.

3 replies

PierreAtSafe
Safer
Forum|alt.badge.img+8
NewOpen

bruceharold
Supporter
Forum|alt.badge.img+19
  • Author
  • Supporter
  • November 10, 2025

I refactored a test workspace to use DuckDB in a PythonCaller, which has a class method to return support for bulk mode, but using a feature list (like in the PythonCaller help) to read what DuckDB read from source filled memory (16 million point features) so wasn’t really a solution.  Backing off to a cursor worked.  If we could go directly from a DuckDB relation to a feature table we might avoid the memory issues and get features in 100K chunks.


bruceharold
Supporter
Forum|alt.badge.img+19
  • Author
  • Supporter
  • November 10, 2025

Now I discover AttributeKeeper can create bulk mode features downstream, which may help...