Swarming: sweep pending tasks to mark as NO_RESOURCE when count of bot which could run the task goes to 0. |
|
Issue descriptionEven with soft device affinity, sometimes we trigger a task and a bot die after. That leads to tasks pending for 6, 7hours. Would be great if the tasks with 0 available bots are ended early. Example: https://chrome-swarming.appspot.com/task?id=3eafc5e5476c0610&refresh=10&show_raw=1 *I am guessing this problem is specific to perf since we are the only one who try to trigger task with specific bot id?
,
Jul 16
Thanks Marc for pointing out the solution. Can someone from infra team help with implementing this? This will help a lot with speeding up the perf waterfall cycle time
,
Jul 16
I guess #2 could be implemented first (since it's what is occurring in your case), it shouldn't be too hard since it's happening in a cron job already. As for when it's going to be implemented, I can't give you a definite time table yet. |
|
►
Sign in to add a comment |
|
Comment 1 by mar...@chromium.org
, Jul 16Summary: Swarming: sweep pending tasks to mark as NO_RESOURCE when count of bot which could run the task goes to 0. (was: Stop swarming jobs that have no available bots early)