-
-
Notifications
You must be signed in to change notification settings - Fork 747
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Workflows stuck in delayed queue since hours/days and never gets executed #6230
Comments
@arm4b Any solution here? |
Can you show what state your movements are in? I asked the same question today at Slack. I have been troubled for a long time and I am still trying to solve it. |
My workflow never makes it to execution , the older ones. If any how it makes then it just keeps on holding at some or the other task for hours and completed by 2 or 3 Hours. |
Also, i am getting this error as well. root@stackstorm:~# st2 execution list -l --status delayed -n 2000 2>/dev/null |
This is a known issue. If rabbitmq retry connections are exhausted then an action is stuck running forever. likely your box is experiencing some network issues internally. Do your workflows create very large context or have very large inputs or outputs? |
Thanks @guzzijones for replying. But the main concern is ST2 keeps them in queues for days and never even executes it. |
Can you see what state most workflow instances are in? |
They are all stuck in delayed state. More than 5000 workflows. |
What is in your st2-workflow-engine logs and st2-action-runner logs. I bet you see disconnects to rabbit-mq. |
@guzzijones This is a big performance issue. |
Hi There,
If anyone could help me with an issue, we are using ST2 extensively and many workflows runs on the box. The box has good configuration, in terms of memory and CPU.
It has been noticed that workflows gets queued and never gets executed and delayed queue is far too long always when checked.
For example:
If i check the delay queue running the following command (st2 execution list -l --status delayed) , i would be able to see workflows for 2 days before that never made to execution. Because of this , it is seen that other workflows also gets impacted in a way that it takes 50 minutes for a simple workflow to finish that generally takes 10 minutes.
Anybody who can help me here?
Example:
The text was updated successfully, but these errors were encountered: