-
Notifications
You must be signed in to change notification settings - Fork 545
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Investigate queue backlog #3978
Comments
Two that have been doing their tasks for weeks are stuck here: |
I've looked at two instances suffering from this issue. Their stacktraces looked roughly similar:
This seems like Python is stuck in some infinite regex |
Got a stacktrace from a third:
|
I think a few hundred bots are blocked by this. |
Another thing we should have is better task killing. A bot should kill a process that runs a task for too long. |
Another lesson we can learn is to have alerts when queue backlogs get too long. This is both a symptom of problems and an issue in of itself. |
Enormous stacktraces containing a giant array on a single line is causing bots to freeze. Although fuzztest really should not be printing an input this large, let's try to be resilient when it misbehaves. Fixes: #3978
This has been mitigated in ClusterFuzz and fuzztest no longer does this. |
But we should still deal with backlogs better. |
There is a 100k unacked message backlog.
Interestingly, many of these tasks appear taken but not finished.
The text was updated successfully, but these errors were encountered: