I've been working on a new workflow and in 2020 I've been excited to see the increased support for Bulk Mode - It's awesome.
However, I feel like in the workflow that I've been creating Bulk Mode has been causing some performance issues.
In a few places in the log I see things like this
2020-06-23 09:47:51| 5.8| 1.4|INFORM|Aggregator_4 (AggregateFactory): Group 1 / 1: Preparing to divide 11 features into groups
2020-06-23 09:47:52| 6.1| 0.3|INFORM|Aggregator_4 (AggregateFactory): Group 1 / 1: Dividing 11 features into groups
2020-06-23 09:47:52| 6.3| 0.2|INFORM|Aggregator_4 (AggregateFactory): Group 1 / 1: Splitting bulk mode features into features
2020-06-23 09:47:53| 8.0| 1.8|INFORM|Aggregator_6 (AggregateFactory): Group 1 / 1: Preparing to divide 4 features into groups
2020-06-23 09:47:54| 8.5| 0.5|INFORM|Aggregator_6 (AggregateFactory): Group 1 / 1: Dividing 4 features into groups
2020-06-23 09:47:54| 8.7| 0.2|INFORM|Aggregator_6 (AggregateFactory): Group 1 / 1: Splitting bulk mode features into features
2020-06-23 09:48:05| 19.9| 11.2|INFORM|SQLExecutor_15 (QueryFactory): Splitting bulk mode features into features
The SQLExecutor here take 11 seconds - Here it's one feature going in and ~20,000 coming out (I'm merging attributes)
Admittedly there are quite a large number of attributes that I'm working with here.
There is no way for me to only process in Bulk Mode due to the transformers not being supported
What I'm after is for some tips that people might have to get the most out of bulk mode and reduce (as much as possible) the effects from splitting the features up.
Thanks in advance