Switch priority on FYI swarming bots |
||
Issue descriptionhttps://build.chromium.org/p/chromium.fyi/builders/Mojo%20Linux?numbuilds=200 is our FYI bot for the network service. During daytime, we often have long build cycles because our FYI swarming priority is low, so for example https://build.chromium.org/p/chromium.fyi/builders/Mojo%20Linux/builds/7439 has 35+min pending time for jobs. We need this to cycle fast, as it helps us track down when regressions occur. We're not ready to move this out of FYI yet, but in the meantime is it possible to set the priority on swarming tasks for this bot to be similar to main waterfall? I couldn't find a way. Dirk: please triage, thanks
,
Nov 20 2017
What if the CQ and waterfall are not using all the capacity, but combined with FYI bots they are? we might not care about most FYI bots getting slowed down a bit. But in our bot's case, we do. So specifying a swarming capacity just for that bot seems lighter weight than creating a new master. I'm not really enthusiastic using LUCI for our bot at this point.
,
Nov 20 2017
We shouldn't be using all of our capacity even for "CQ + Waterfall + FYI", which is why getting more capacity is the right answer and my top priority. I understand your reluctance about setting a new master and/or moving to LUCI. And, changing the priority is certainly lighter weight, but it has downsides in that it means that different FYI bots get different QoS (which I understand is what you want) which is a change from how we actually do things today and something I don't particularly want to encourage. Hopefully my desire to get more capacity in the next day or two is enough of a short term answer. In a few weeks, we will earnestly be moving bots to LUCI, and at that point we can look into better longer-term answers.
,
Nov 21 2017
Ok, thanks for the explanations. I wonder if removing most of the old navigation tests gives back enough capacity? I can't find a way to see capacity usage of swarming bots (the status link is broken)
,
Nov 21 2017
The graph I use for monitoring the capacity of the linux swarming pool is http://shortn/_dXaTT4clmM . Yesterday we were clearly maxing things out. I'll keep an eye on it today.
,
Dec 15 2017
Marking this as WontFix since we've added capacity and AFAIK don't have any current issues, and since I don't want to actually change the priority as per the discussion above. |
||
►
Sign in to add a comment |
||
Comment 1 by dpranke@chromium.org
, Nov 20 2017