For bot centric monitoring:
Update pools.cfg to define the dimensions that we want to group the bots to define utilization levels, health, time spent in maintenance/quarantined mode, etc.
https://cs.chromium.org/chromium/infra/luci/appengine/swarming/proto/pools.proto
This shall also define a pubsub topic where the time series can be streamed to at a one minute resolution (more discussion is needed about this). The default is streaming through ts_mon.
This is only the luci-config part, not the implementation.
Comment 1 by mar...@chromium.org
, Jul 17