New issue
Advanced search Search tips

Issue 864722 link

Starred by 1 user

Issue metadata

Status: Assigned
Owner:
Cc:
Components:
EstimatedDays: ----
NextAction: ----
OS: ----
Pri: 2
Type: Feature

Blocking:
issue 408424
issue 443739
issue 864728
issue 622538
issue 645323
issue 729851
issue 752244



Sign in to add a comment

Swarming: define task bucketing in pools.cfg

Project Member Reported by mar...@chromium.org, Jul 17

Issue description

For task centric monitoring:

Update pools.cfg to define the tags (which can also be dimensions) that we want to group the tasks. This enables evaluating pending time, actual throughput, etc.
https://cs.chromium.org/chromium/infra/luci/appengine/swarming/proto/pools.proto

This shall also define a pubsub topic where the time series can be streamed to at a one minute resolution (more discussion is needed about this). The default is streaming through ts_mon. 

This is only the luci-config part, not the implementation.
 
Blocking: 864728
 Issue 866052  will act as a defacto reason to add pools.cfg everywhere. These configurations will live in pools.cfg.
Project Member

Comment 3 by bugdroid1@chromium.org, Dec 7

The following revision refers to this bug:
  https://chromium.googlesource.com/infra/luci/luci-py.git/+/bd9ae51d3d87f6949fadd7165493a65d221d8cc1

commit bd9ae51d3d87f6949fadd7165493a65d221d8cc1
Author: Marc-Antoine Ruel <maruel@chromium.org>
Date: Fri Dec 07 12:59:01 2018

[swarming] Add cron job for both bot and task monitoring

They don't do anything, it is simply to ease the deployment by adding
the handlers right away so I can check in cron.yaml update soonish.

Start adding bot events in memcache; this will accelerate monitoring by
an order of magnitude.

Bug: 864722
Bug: 864724
Change-Id: I5702dccb6ee341414db6ebb69e1a207e748f5d29
Reviewed-on: https://chromium-review.googlesource.com/c/1366635
Commit-Queue: Marc-Antoine Ruel <maruel@chromium.org>
Reviewed-by: Quinten Yearsley <qyearsley@chromium.org>

[add] https://crrev.com/bd9ae51d3d87f6949fadd7165493a65d221d8cc1/appengine/swarming/doc/Monitoring.md
[modify] https://crrev.com/bd9ae51d3d87f6949fadd7165493a65d221d8cc1/appengine/swarming/handlers_backend.py
[modify] https://crrev.com/bd9ae51d3d87f6949fadd7165493a65d221d8cc1/appengine/swarming/server/bot_management.py
[modify] https://crrev.com/bd9ae51d3d87f6949fadd7165493a65d221d8cc1/appengine/swarming/server/bot_management_test.py
[add] https://crrev.com/bd9ae51d3d87f6949fadd7165493a65d221d8cc1/appengine/swarming/server/stats_bots.py
[add] https://crrev.com/bd9ae51d3d87f6949fadd7165493a65d221d8cc1/appengine/swarming/server/stats_bots_test.py
[add] https://crrev.com/bd9ae51d3d87f6949fadd7165493a65d221d8cc1/appengine/swarming/server/stats_tasks.py
[add] https://crrev.com/bd9ae51d3d87f6949fadd7165493a65d221d8cc1/appengine/swarming/server/stats_tasks_test.py

Sign in to add a comment