I am on FME Server 2022.2.8, and I have a quite big automation where the workspaces communicate through json messages. Now I have been able to replicate a crash in too machines, but I haven’t been able to isolate the exact problems. I think it might be related to https://support.safe.com/hc/en-us/articles/25407442818189-Known-Issue-FME-Server-Core-java-lang-OutOfMemoryError-Java-heap-space
If I start processing that involves some 100 workspaces queued and running with quite long json messages the rest api stops responding eventually (about 30 minutes in, returning 500 errors). Processing seems to continue and automation is running fine, the Server ui seems ok (I think it is relying on REST API v4).
Restarting the FME (or the server) does not help the problem, but cleaning up rows in the fme_job_history table seems to help.
I have worked around the problem by re-architecting the Automation to make the messages smaller, so we are in the clear now.
Just if anyone else have similar problems!