The current bot states are: idle, busy, dead, maintenance, quarantined.
Problems:
- The states do not represent 100% of the bot time.
- When an host is rebooting, it's currently listed as idle.
- While a bot cleans up its cache, it's currently listed as idle. This used to be mostly done within task's scope but this changed with issue 868083 .
- Bot hooks are reported independently from the bot itself. This means it cannot be represented on the server as a bot state. This is tracked as issue 835274.
AI:
- Add new states:
- "overhead" for internal cleanup; this could be subclassified as:
- hooks (with the hook name)
- "cache_cleanup" for isolated and named cache cleanup.
- "rebooting" when we know the host is rebooting due to the bot's action.
http://go/swarming-monitoring-v2#heading=h.tn6n7ysq9dtk
Comment 1 by mar...@chromium.org
, Aug 3