Support for apache arrow in memory data exchange format as input to Pythn script caller and allow an apache arrow object to be returned by the python caller as well would be great. There would be a number of advantages to this.
- Remove the need to install fme objects into my python environment.
- Make the python interface cleaner and easier to use. Just convert the apacje arrow object into your data frame, Reducing the barrier to using python in workflows.
- Scripts could more easily be developed and tested in IDEs like vscode, and pycharm making the development of scripts far nicer.
- Supported by many libraries. e.g. tensorflow, pandas, polars, duckdb, pyspark etc.
- Would make it easier to embed machine learning models into workflows.
- Potentially opens up the integration of other languages into FME due to the arrow format being language an gnostic.