Identify dedidcated vs. pooled buildslaves on PFQ build slaves |
||||||
Issue descriptionWhen a PFQ slave builder is starting late, it is likely because its primary buildslave is offline. It would be helpful for debugging this if we identified: * Whether a buildslave is dedicated or part of a shared pool * When a buildslave is busy When PFQ builds start late they are almost guaranteed to time out since there is a relatively narrow margin for slow builds. See issue 648436 for some background.
,
Sep 21 2016
Yes, that's what I just did, but it requires some knowledge of how the slaves are allocated (I added a YAQS entry but most gardeners are not going to remember that), and they are only going to do that when investigating a problem. This could potentially give us a heads up in advance, or assist in identifying the problem sooner (without pestering the infra team). Consider this a suggestion, feel free to ignore.
,
Sep 21 2016
This is specialized for CrOS and would require some BuildBot code plumbing to enable. Since the current UI is sufficient and this sort of thing should be automated anyway, I don't think investing in augmenting the UI with this information is very valuable.
,
Sep 27 2016
Marking as fixit to automate the flow.
,
Sep 28 2017
This issue has been Available for over a year. If it's no longer important or seems unlikely to be fixed, please consider closing it out. If it is important, please re-triage the issue. Sorry for the inconvenience if the bug really should have been left as Available. If you change it back, also remove the "Hotlist-Recharge-Cold" label. For more details visit https://www.chromium.org/issue-tracking/autotriage - Your friendly Sheriffbot
,
May 17 2018
,
May 29 2018
This will be completely resolved by Swarming as we'll no longer be tethered to a single machine. This failure mode won't happen. Timeline: 2-3 months from now, at the latest. |
||||||
►
Sign in to add a comment |
||||||
Comment 1 by d...@chromium.org
, Sep 21 2016