[LUCI-Beta-Bug] fetch_telemetry_binary_dependencies has high failure rate on swarming. |
||||||||||||||||||||||
Issue descriptionHere's an example: https://logs.chromium.org/v/?s=chromium%2Fbuildbucket%2Fcr-buildbucket.appspot.com%2F8963910429643036416%2F%2B%2Fsteps%2Fgclient_runhooks__with_patch_%2F0%2Fstdout . I haven't noticed this occurring as often with the other builders.
,
Nov 8 2017
this is making luci unusable. i.e. https://ci.chromium.org/p/chromium/builders/luci.chromium.try/linux_chromium_rel_ng?limit=200 has 53 flakes out of 200 right now while https://build.chromium.org/p/tryserver.chromium.linux/builders/linux_chromium_rel_ng?numbuilds=200 has 0 (i'll have to opt out in the meantime)
,
Nov 8 2017
,
Nov 8 2017
,
Nov 8 2017
This bug is the same one I hit on test LUCI CI builder around the same time, see issue 781413. The solution should be turning on task service accounts for luci.chromium.try. +vadimsh@
,
Nov 8 2017
Yeah, except I don't understand why it is flaky rather than always failing. There's something we don't know about telemetry hooks...
,
Nov 8 2017
+people who may know specifics about why fetch_telemetry_binary_dependencies is failing on some swarming bots.
,
Nov 8 2017
That is the sign of cloud storage authentication is not set up properly. But that script is supposed to be run during "gclient sync" only, why is it run on swarming bot?
,
Nov 8 2017
this is a LUCI build. All LUCI builds run on swarming. I am adding task service account
,
Nov 8 2017
,
Nov 8 2017
https://chromium.googlesource.com/chromium/src/+/infra/config/cr-buildbucket.cfg was updated to include task service account for all builds on luci.chromium.try. It should help.
,
Nov 8 2017
i meant, it should fix the issue
,
Nov 9 2017
,
Nov 9 2017
There were no runhooks failures since 4:10pm PDT
,
Nov 9 2017
https://ci.chromium.org/p/chromium/builders/luci.chromium.try/linux_chromium_rel_ng?limit=200 has no runhooks faiures sorry for taking so long to fix this
,
Nov 9 2017
I'm reopening this as I'm seeing it on try jobs from different patches today: WARNING:root:Unable to import cv2 due to: /usr/lib/x86_64-linux-gnu/libstdc++.so.6: version `GLIBCXX_3.4.20' not found (required by /b/swarming/w/ir/cache/builder/linux/src/third_party/catapult/telemetry/third_party/cv2/lib/cv2_linux_x86_64_85be3046d2fef651206d7daadbd1b34af2a005f5/cv2.so) WARNING:root:Unable to import psutil due to: No module named psutil CRITICAL:devil.utils.cmd_helper:STDERR: unable to initialize libusb: -99 https://logs.chromium.org/v/?s=chromium%2Fbuildbucket%2Fcr-buildbucket.appspot.com%2F8963372731254296368%2F%2B%2Fsteps%2Fgclient_runhooks__with_patch_%2F0%2Fstdout and https://ci.chromium.org/swarming/task/39bb4957dec18d10?server=chromium-swarm.appspot.com
,
Nov 9 2017
this looks the same, but different. Looks like fetch_telemetry_binary_dependencies depends on presence of psutil without declaring the dependency anywhere (see go/vpython). I think this means that gclient sync should run hooks via vpython as opposed to python? https://cs.chromium.org/chromium/tools/depot_tools/gclient.py?q=gclient.py&sq=package:chromium&l=210 +iannucci and +dnj if he has time
,
Nov 9 2017
I've excluded Chromium LUCI dogfooders from https://chrome-infra-auth.appspot.com/auth/groups/luci-chromium-cq-dogfood and created a separate group https://chrome-infra-auth.appspot.com/auth/groups/project-chromium-luci-beta with the end users. To enable again, include project-chromium-luci-beta in luci-chromium-cq-dogfood
,
Nov 9 2017
reverted https://chromium-review.googlesource.com/c/chromium/src/+/760878 and added catapult roller to LUCI dogfood list so it does not break LUCI again
,
Nov 9 2017
,
Nov 9 2017
the builds that are running right now, either before the roll or after revert, pass runhooks. with the revert and adding the roller to dogfooders, this is no longer an emergency, so p1. I will wait for a few green builds before adding dogfooders back though.
,
Nov 9 2017
as expected, CLs that don't include the roll, WAI https://ci.chromium.org/p/chromium/builders/luci.chromium.try/linux_chromium_rel_ng catapult roller is now unable to break LUCI and https://chromium-review.googlesource.com/c/chromium/src/+/762037 would make its runhooks pass
,
Nov 10 2017
The following revision refers to this bug: https://chromium.googlesource.com/chromium/tools/depot_tools/+/0ffcc877a6242e54351557f9a5eb53518c0c381f commit 0ffcc877a6242e54351557f9a5eb53518c0c381f Author: Nodir Turakulov <nodir@google.com> Date: Fri Nov 10 01:53:10 2017 [gclient hooks] add .bat to vpython on windows Bug: 781430 Change-Id: Idcba016f78078aa9678b8a246e964b3dcb09a016 Reviewed-on: https://chromium-review.googlesource.com/762389 Reviewed-by: Robbie Iannucci <iannucci@chromium.org> Commit-Queue: Nodir Turakulov <nodir@chromium.org> [modify] https://crrev.com/0ffcc877a6242e54351557f9a5eb53518c0c381f/gclient.py
,
Nov 10 2017
The following revision refers to this bug: https://chromium.googlesource.com/chromium/src.git/+/e2d81c351d09346de3f27cb784e7d2a6049f0c0c commit e2d81c351d09346de3f27cb784e7d2a6049f0c0c Author: Nodir Turakulov <nodir@google.com> Date: Fri Nov 10 07:47:27 2017 [DEPS] run hooks via vpython Hooks tend to have implicit dependencies //.vpython covers or should cover all of them, so run all hooks via vpython Also add missing catapult's dependencies to .vpython. Bug: 781430 Change-Id: I5ec7a760f44bcb806c654ad8da303f78d3dbee3f Reviewed-on: https://chromium-review.googlesource.com/762037 Commit-Queue: Nodir Turakulov <nodir@chromium.org> Reviewed-by: Robbie Iannucci <iannucci@chromium.org> Cr-Commit-Position: refs/heads/master@{#515499} [modify] https://crrev.com/e2d81c351d09346de3f27cb784e7d2a6049f0c0c/.vpython [modify] https://crrev.com/e2d81c351d09346de3f27cb784e7d2a6049f0c0c/DEPS
,
Nov 10 2017
https://chromium-review.googlesource.com/762037 fixed catapult hooks on LUCI, they succeeded on roller’s CL https://ci.chromium.org/swarming/task/39bf4b50f5d86210?server=chromium-swarm.appspot.com I’ve included project-chromium-luci-beta group back to luci-chromium-cq-dogfood
,
Jan 31 2018
,
Feb 13 2018
,
Jan 16
(6 days ago)
,
Jan 16
(6 days ago)
|
||||||||||||||||||||||
►
Sign in to add a comment |
||||||||||||||||||||||
Comment 1 by estaab@chromium.org
, Nov 8 2017Status: Assigned (was: Untriaged)