Implement cron job to detect hung analyses. |
||||
Issue descriptionWe should also trigger an alert if analyses fail to make progress after 24 hours, (due to resource starvation, continuous loop in logic, external failures, etc.) To achive this we could add an hourly cron job that checks all analyses in progress and records the culprit range for all jobs. If this hasn't changed in 24 hours, then we can consider the analysis as hung.
,
Oct 2 2017
If the purpose is to auto-rerun the analyses, it might be a different story. But we are not there yet.
,
Jan 5 2018
,
Apr 26 2018
We have a different approach for this. |
||||
►
Sign in to add a comment |
||||
Comment 1 by st...@chromium.org
, Oct 2 2017