Dump the list of tests that are running when we are close to overall shard timeout |
|||||
Issue descriptionCurrently, when shard is timing out, e.g. see https://chromium-swarm.appspot.com/task?id=3e9820d1959edc10, we can't tell which tests are taking too long. It would nice if we could dump the list of tests that are running when that happens. One way to implement this is to capture SIGTERM from swarming. Alternatively, we can have our own watchdog thread in test runner that prints the list of running tests when we are close to 45 minutes.
,
Jul 11
,
Jul 11
Michael, do you have a draft CL for your failed attempt?
,
Jul 12
I think I only tried locally. But on Mac, on failed death-tests we can see in some builds that the text we usually print on caught SIGTERM is not printed. So we couldn't print anything else either...
,
Jul 12
The it seems we need to go with the watchdog thread approach that will have a way to communicate with the worker thread to get the list of currently running tests. One disadvantage with this is that we'll like have to duplicate the information about the timeout in cr-buildbucket.cfg and in logic for this watchdog thread, unless we find a way to request the current timeout from parent swarming process on the bot somehow or by reading cr-buildbucket.cfg via Gitiles APIs.
,
Oct 11
,
Oct 19
|
|||||
►
Sign in to add a comment |
|||||
Comment 1 by machenb...@chromium.org
, Jul 11