Hi, we are seeing some odd behaviour in our FME Server 2015.1 instance
We are running a 'runner' workbench, that uses 3 chained fmeserverjobsubmitter transfomers to call 3 'workers' and all are configured in 'sync' mode (i.e. wait for the job to finish)
Occasionally the first worker (which can take just over an hour to run) is submitted twice. The timing of the second submission appears to be random and between 10-50 minutes.
the driver workbench log shows no record of triggering this second job and appears to fail with the error
(ServerJobSubmissionFactory): A connection error occurred with server "http://fme-server.prod/fmeserver"
both the workers appear to complete successfully sometime after the runner has failed.
Does anyone know what conditions might be causing this automatic resubmission of a job and, ideally, how to prevent it?
I suspect that some component of FME Server is losing connectivity with the initial job, assuming failure, trying again and losing connectivity with the second subsequent job and finally quitting.
Luckily this is just an extract type process, so not causing database issues.
regards,
Nick
p.s. (we are working on an upgrade path, but this 'fix' option is not currently available!)