I've been working on a new workflow and in 2020 I've been excited to see the increased support for Bulk Mode - It's awesome.
However, I feel like in the workflow that I've been creating Bulk Mode has been causing some performance issues.
In a few places in the log I see things like this
12020-06-23 09:47:51| 5.8| 1.4|INFORM|Aggregator_4 (AggregateFactory): Group 1 / 1: Preparing to divide 11 features into groups22020-06-23 09:47:52| 6.1| 0.3|INFORM|Aggregator_4 (AggregateFactory): Group 1 / 1: Dividing 11 features into groups32020-06-23 09:47:52| 6.3| 0.2|INFORM|Aggregator_4 (AggregateFactory): Group 1 / 1: Splitting bulk mode features into features42020-06-23 09:47:53| 8.0| 1.8|INFORM|Aggregator_6 (AggregateFactory): Group 1 / 1: Preparing to divide 4 features into groups52020-06-23 09:47:54| 8.5| 0.5|INFORM|Aggregator_6 (AggregateFactory): Group 1 / 1: Dividing 4 features into groups62020-06-23 09:47:54| 8.7| 0.2|INFORM|Aggregator_6 (AggregateFactory): Group 1 / 1: Splitting bulk mode features into features72020-06-23 09:48:05| 19.9| 11.2|INFORM|SQLExecutor_15 (QueryFactory): Splitting bulk mode features into features8
The SQLExecutor here take 11 seconds - Here it's one feature going in and ~20,000 coming out (I'm merging attributes)
Admittedly there are quite a large number of attributes that I'm working with here.
There is no way for me to only process in Bulk Mode due to the transformers not being supported
What I'm after is for some tips that people might have to get the most out of bulk mode and reduce (as much as possible) the effects from splitting the features up.
Thanks in advance