New issue
Advanced search Search tips
Note: Color blocks (like or ) mean that a user may not be available. Tooltip shows the reason.

Issue 857177 link

Starred by 1 user

Issue metadata

Status: Available
Owner: ----
Cc:
EstimatedDays: ----
NextAction: 2018-07-09
OS: ----
Pri: 3
Type: Feature

Blocking:
issue 644805



Sign in to add a comment

Undetected poison change passed CQ for Fuchsia x64

Project Member Reported by sheriff-...@appspot.gserviceaccount.com, Jun 27 2018

Issue description

Filed by sheriff-o-matic@appspot.gserviceaccount.com on behalf of petermayo@google.com

content_unittests failing on chromium.linux/Fuchsia x64

Unable to launch content_unittests.

Accused CL: https://chromium-review.googlesource.com/1113946

Passed try run here: https://ci.chromium.org/p/chromium/builders/luci.chromium.try/fuchsia_x64/51652


Builders failed on: 
- Fuchsia x64: 
  https://ci.chromium.org/p/chromium/builders/luci.chromium.ci/Fuchsia%20x64
- Fuchsia x64 try:
  https://ci.chromium.org/p/chromium/builders/luci.chromium.try/fuchsia_x64/51735-51939
  and more extending that range (obviously some pass by not running the test, and some fail compile too)

(sorry I don't recall the CQ team label)


 
cherryp-picking it after bb3da314544e64b99dbbfd68042649be3dbd13a1 does not repro the issue


ergo there is an interaction between that CL and https://chromium-review.googlesource.com/c/chromium/src/+/1114850

db409e2 Truncate string attributes in the accessibility tree by James Wallace-Lee · 3 hours ago
b050405 Roll src/third_party/libvpx/source/libvpx/ 8648a64c8..583859d73 (26 commits) by Jerome Jiang · 3 hours ago
9ef7f45 [Display Cutout] Fix bug to send safe areas to current frame by Becca Hughes · 3 hours ago
29f4339 [cr-buildbucket.cfg] Use win7 for Win7 Builder (dbg) Goma Canary by Ryan Tseng · 3 hours ago
69a3377 Make mime handlers inherit the session storage of their embedders. by Sam McNally · 3 hours ago
30184c7 Add a test helper method in services/network to get the body from a ResourceRequest. by John Abd-El-Malek · 3 hours ago
339dfd6 Add missing P check in webview data dir code. by Torne (Richard Coles) · 3 hours ago
ce84d1c Android: Properly track GN .py inputs for actions in internal_rules.gni by Andrew Grieve · 3 hours ago
46c6339 Handle tab drag area bounds correctly for refresh. by Peter Kasting · 3 hours ago
fb64128 Update V8 to version 6.9.234. by v8-ci-autoroll-builder · 3 hours ago
6daf566 [Display Cutout] Refactor safe areas by Becca Hughes · 3 hours ago
a40efea Small cleanups/improvements in and around ProfileSyncServiceStartupTest by Marc Treib · 3 hours ago
b3a4a4b Call DidChangeVisibleSecurityState when interstitial is dismissed. by Eugene But · 3 hours ago
2c670dc [ios] Keyboard shortcuts in tab grid by edchin · 3 hours ago
61c9e43 Add "Remove Users" remote command by Ivan Sandrk · 3 hours ago
827bd42 Disable loading.mobile/G1_3g on Nexus_5 by Oystein Eftevaag · 3 hours ago
1cf00a1 [tcmalloc] Provide low memory footprint build option for tcmalloc. by Jason Roberts · 3 hours ago
b63f00d Fix chrome://kill debug URL to simulate process kills correctly. by Wez · 3 hours ago
9c76065 [ios] Use launch screen for incognito blocker. by Justin Cohen · 3 hours ago
3538353 Revert "Pull GN via CIPD package" by Scott Graham · 3 hours ago
d0ed577 Allow TextureLayer to hold onto software resources over viz restart by danakj · 3 hours ago
92eabd4 cros: move MagnifierType to chrome/ by Qiang Xu · 3 hours ago
fdadc5f [Payments] Open new foreground tab triggered by user in payment handler window in browser by gogerald · 3 hours ago
8343057 Use base::TimeTicks in OffScreenCanvas by Adithya Srinivasan · 3 hours ago
5ac08a7 Make GL state and ContextState consistent by Jonathan Backer · 3 hours ago
0e92e3b Don't show popup sign in window for locked profile or guest/system profile. by Owen Min · 3 hours ago
0895c797 Pull GN via CIPD package by Scott Graham · 3 hours ago
7aeceb4 Fix regression that ComputeInlineBoxPosition no longer enters inline blocks by Xiaocheng Hu · 3 hours ago
09c2c54 SignedExchange: Apply scheme restriction to {cert,validity}-url by Kouhei Ueno · 4 hours ago
34cb384 Fix Chrome OS touchpad pinch DCHECK by Kevin McNee · 4 hours ago
02f5626 Roll src/third_party/perfetto fa2ef190cd68..32dc063f6647 (2 commits) by perfetto-chromium-autoroll · 4 hours ago
9923024 Adding ability to generate sharding maps for a subset of benchmarks by Emily Hanley · 4 hours ago
1a64677 Notify textChanged if button's children change by James Wallace-Lee · 4 hours ago
80be7ab Stop recording "Profile destruction" bucket of Sync.StopSource by Marc Treib · 4 hours ago
Cc: dmazz...@chromium.org jamwalla@chromium.org
 Issue 857176  has been merged into this issue.
Cc: fdegans@chromium.org
+fdegans who was helpful in reproing the failure and testing
Looks like the bot is green again after the revert: https://ci.chromium.org/p/chromium/builders/luci.chromium.ci/Fuchsia%20x64

Do you have an idea why the bot was failing? From https://logs.chromium.org/v/?s=chromium%2Fbuildbucket%2Fcr-buildbucket.appspot.com%2F8942539095223463824%2F%2B%2Fsteps%2Fcontent_unittests%2F0%2Fstdout

I can't see how my change could cause this error:
[ERROR:garnet/bin/appmgr/root_loader.cc(71)] Could not load url: file://content_unittests

nor do I, but
  - reverting makes the error go away
  - landing it on db409e2 or later causes it to fail
  - landing it on bb3da314544e64b99dbbfd68042649be3dbd13a1 doesn't

to find out you could do a git bisect, or you could use the chromium infrastructure and git cl try -r <rev> for the revs above (please keep the number of tries reasonable, e.g. do n-ary division)

Good luck
I did some bisecting locally, it looks like landing this CL after https://chromium.googlesource.com/chromium/src/+/1a64677500027ed83a9585040feb8a46438e7c58 is the source of failure.
I'm still not sure why though.
Blocking: 644805
Sheriffs: if this was the result of two conflicting changes landing at roughly the same time, it's not clear what you expect troopers (or cq folks) to do. Please clarify.
re #9:  Ideally we would find interesting ways to detect and avert.

In previous discussions at the start of CQ days the belief was that these problems were vanishingly small. That makes the study of instances of them happening in real life interesting.

When filed, it was unclear why the test would fail by hanging/aborting/failing to launch and whether there was a confounding issue.  That's still a little interesting, but probably after the devs involved find out how the two CLs conflicted.  I don't think there is any trooper input on that now.

This would also be an interesting case for findit to have been able to detect and revert.  Issuing try jobs for empty CLs at the revisions in the failure range would have found the CL to revert.  34 is clearly(?) too many try jobs, but a few layers could have done it quickly.

I would like the people who implement and monitor the test infrastructure to see what happened here more than dictate action or implementation for them.
Labels: -Infra-Troopers Infra-CommitQueue Type-Feature
I'm not sure how or if this could turn into a change.

More details of how the changes conflicted would be useful in figuring out how/if this sort of thing could be detected.
blocked on  issue 858692 , I think?
Labels: -Sheriff-Chromium
Can we close this? From  issue 858692  it looks like the content_unittests failure was fixed; I relanded the original CL.
Re:#14 I hope not.  This issue is about making the infrastructure better able to detect and identify the type of failure.

Perhaps if you have details of how a flaky package manager caused the content_unittests to pass on the CQ run but reliably fail after landing, and then reliably succeed after revert?

i.e. did the flakiness violate a precondition on which the system design is based?
Labels: Pri-3
NextAction: 2018-07-09
Downgrading P2s that haven't been modified in more than 6 months, which also do not have a component or owner.

Sign in to add a comment