Question

Job Cancel and engine restart

  • 9 June 2023
  • 6 replies
  • 21 views

Hi,

I have a question regarding job cancelling on FME server UI. I have a job running which I would need to cancel during it running. When I select the job and click cancel button, it will cancel job but starting the engine. and all others in the queue will be failed directly.

 

I would like to know if there is a way that we can cancel running jobs without restarting engine and the next in the queue will be starting to run?

 

I did realise we could set up "Running Job Expire Time" to cancel job with specific max running time. but I would like to know, if I set it, when it cancel job that over that time, will engine be restart?

 

Thanks for any advice in advance


6 replies

Userlevel 5
Badge +26

That's definitely not what should be happening. Cancelling a running job should not have any effect on the rest of the queue (there is a regular engine reboot after an x number of jobs if i recall correctly, but that too should not have any effect on the queued jobs).

 

This sounds like something you should send in to support, either Safe or your local reseller, to take a look at.

Userlevel 5

I completely agree with @Hans van der Maarel​ , this is unexpected behavior.

You could start by verifying the value of MAX_TRANSACTION_RESULT_FAILURES in fmeServerConfig.txt, it should be set to 10 (default). Any other value could indicate that someone has seen a need to modify this value, which could indicate a related problem.

@Hans van der Maarel​ and @david_r​ , thank you both for the reply. :)

 

so cancelling running job won't cause the engine restarting? I would check with our architect to get more information.

 

could I please ask if cancelling job won't trigger engine restarting, does the running job expiry time parameter can be used to control the time out for each job? image.pngif there any method we could re-run timed out job automatically in certain period?

 

Again, thanks heaps

 

Userlevel 5
Badge +26

@Hans van der Maarel​ and @david_r​ , thank you both for the reply. :)

 

so cancelling running job won't cause the engine restarting? I would check with our architect to get more information.

 

could I please ask if cancelling job won't trigger engine restarting, does the running job expiry time parameter can be used to control the time out for each job? image.pngif there any method we could re-run timed out job automatically in certain period?

 

Again, thanks heaps

 

Cancelling a job shouldn't cause an engine restart (unless FME Server was planning to do that already) and it certainly shouldn't cause queued jobs to disappear.

 

You can use the Running Job Expiry Time to automatically cancel a job that runs too long, but that won't resubmit it (because it would most likely take just as much time and then cause it to be cancelled again, putting you in an infinite loop).

Userlevel 5
Badge +28

this does sound odd for sure - It would be intersing to know what kind of FME Server set up you're working with and which version of FME Server your running.

If you are running on a system with scaling engines it might be that the engine running is a temporary one and (maybe) also it's queue.

Typically if a job is in a queue with no engine then it just gets stuck but maybe there is some kind of cleanup job that kills any stranded jobs?

It doesn't explain the engine restart but perhaps the MAX_TRANSACTION_RESULT_FAILURES is set to 1 as @david_r​ suggested.

 

Either way it's not normal behaviour and you should be able to cancel a running job without effecting other jobs in the queue.

@Hans van der Maarel​  and @virtualcitymatt​ , thanks heaps for your reply. we found that fmeServerConfig.txt and MAX_TRANSACTION_RESULT_FAILURES is 10 as default. but when i am testing time expiry cancelling, it still trying to restart the engine. like this. we are using high availability architecture, engine VM and core VM are separated VM. would think this is the issue? image

Reply