Hello, the other day I found many timers that didn’t fire when at their duedate (september 10th). Yesterday I tested those same processes just changing the timer’s value. I tried with 1 minute, 1 hour and 5 hours and all of them worked perfectly.
So I though maybe the server restarting is messing all up. So today I launched the processes with 1 hour timer again and restarted the server (tomcat7) afterwards. One hour after I see all the processes waiting for the timer and all the jobs in the database with their duedates in the past. Then I relaunched the processes with 1 minute timers expecting them to work because I’m not restarting the server again, but they get stuck as well.
I have no clue what may be happening. Any help is appreciated.
My configuration of the jobExecutor in bpm-platform.xml:
after the timers fire, are you routing to a service task which is using a connector to make a remote procedure call? If so, check that the remote end points are not blocking etc.
Hello, the rest of the process is fine, it connects to a web service and send some emails, but it works correctly. In many of my tests has worked without problem. The job is not executing, I just checked the logs of today. 21 processes should have fire their timers around 4:30 AM. The log showed the server restart at 1 AM and the next trace is from 9 AM. Both the engine and the endpoint it connects after the timer are in the same server and both of them catch all exceptions and log them. I also include execution listeners at the start and end of every element and log those executions to check the process trace when something goes wrong.
Then you should disable the flag jobExecutorDeploymentAware in the platform configuraiton. This flag makes sense when processes require external resources such as classes that are provided by process applications. Then the flag and registrations managed by ManagementService avoid job execution when those resources are not available. See Service Task with Async Continuation Never Executes for a detailed discussion of this.
For the installation pages, a header and small chunk of text would be placed in each of the relevant pages related to shared process engine deployments.
Would likely be good to come up with a keyword focused description of the issue that can be better picked up by a google search.
I changed the developmentAware value to false and restarted the server. At that point about 200 processes that were stuck (processes I launched for testing) went forward, and another 500 got rescheduled for either tonight or a week from now. I guess it is like a restart of the timers? I got some timers with a value of 5 hours and others with a value of 1 week.
The retries count is still at 3 for every job.
Besides the funny behaviour I think it is working properly now. Thanks guys! Really appreciate the help.