TooManyDisconnectedSlaves is firing when masters restart |
|||
Issue descriptionPotential solutions: * balance metric against master uptime (so don't fire it unless the master uptime is X) * have master report all slaves in a third "reconnecting" state immediately after the master comes back online. In addition to letting us fix this alert, it would also enable us to alert on bots which NEVER connect to the master (sometimes when new bots are added to a master they need to be manually restarted in order to connect).
,
Feb 24 2017
Changing priority since this is happening more often. Should be useful for reducing false positives for TooManyDisconnectedSlaves
,
Mar 15 2017
,
Mar 16 2017
|
|||
►
Sign in to add a comment |
|||
Comment 1 by dsansome@chromium.org
, Feb 17 2017