New issue
Advanced search Search tips
Note: Color blocks (like or ) mean that a user may not be available. Tooltip shows the reason.

Issue 888081 link

Starred by 1 user

Issue metadata

Status: Fixed
Owner:
Closed: Sep 21
Cc:
Components:
EstimatedDays: ----
NextAction: ----
OS: Mac
Pri: 1
Type: Bug

Blocked on:
issue 885337



Sign in to add a comment

Seeing many "Mac lockscreen detected, aborting" errors on Mac AMD GPU.FYI bots

Project Member Reported by ynovikov@chromium.org, Sep 21

Issue description

Affected builds:
https://ci.chromium.org/p/chromium/builders/luci.chromium.ci/Mac%20FYI%20Retina%20Debug%20%28AMD%29/8304
https://ci.chromium.org/p/chromium/builders/luci.chromium.ci/Mac%20FYI%20Retina%20Debug%20%28AMD%29/8307
https://ci.chromium.org/p/chromium/builders/luci.chromium.ci/Mac%20FYI%20Retina%20Debug%20%28AMD%29/8309
https://ci.chromium.org/p/chromium/builders/luci.chromium.ci/Mac%20FYI%20Retina%20Debug%20%28AMD%29/8310
https://ci.chromium.org/p/chromium/builders/luci.chromium.ci/Mac%20FYI%20Retina%20Debug%20%28AMD%29/8311

https://ci.chromium.org/p/chromium/builders/luci.chromium.ci/Mac%20FYI%20Retina%20Release%20%28AMD%29/7842
https://ci.chromium.org/p/chromium/builders/luci.chromium.ci/Mac%20FYI%20Retina%20Release%20%28AMD%29/7843
https://ci.chromium.org/p/chromium/builders/luci.chromium.ci/Mac%20FYI%20Retina%20Release%20%28AMD%29/7844
https://ci.chromium.org/p/chromium/builders/luci.chromium.ci/Mac%20FYI%20Retina%20Release%20%28AMD%29/7845
https://ci.chromium.org/p/chromium/builders/luci.chromium.ci/Mac%20FYI%20Retina%20Release%20%28AMD%29/7846
https://ci.chromium.org/p/chromium/builders/luci.chromium.ci/Mac%20FYI%20Retina%20Release%20%28AMD%29/7851

https://ci.chromium.org/p/chromium/builders/luci.chromium.ci/Mac%20FYI%20Experimental%20Retina%20Release%20%28AMD%29/5222
https://ci.chromium.org/p/chromium/builders/luci.chromium.ci/Mac%20FYI%20Experimental%20Retina%20Release%20%28AMD%29/5223
https://ci.chromium.org/p/chromium/builders/luci.chromium.ci/Mac%20FYI%20Experimental%20Retina%20Release%20%28AMD%29/5224
https://ci.chromium.org/p/chromium/builders/luci.chromium.ci/Mac%20FYI%20Experimental%20Retina%20Release%20%28AMD%29/5226
https://ci.chromium.org/p/chromium/builders/luci.chromium.ci/Mac%20FYI%20Experimental%20Retina%20Release%20%28AMD%29/5227
https://ci.chromium.org/p/chromium/builders/luci.chromium.ci/Mac%20FYI%20Experimental%20Retina%20Release%20%28AMD%29/5232

https://ci.chromium.org/p/chromium/builders/luci.chromium.ci/Mac%20FYI%20GPU%20ASAN%20Release/1955
https://ci.chromium.org/p/chromium/builders/luci.chromium.ci/Mac%20FYI%20GPU%20ASAN%20Release/1956

Affected machines:
build388-m4
build390-m4
build396-m4
build398-m4
build401-m4
build435-m4
build442-m4
build705-m4
build706-m4

I took build706-m4 offline since it was mostly red. Other machines seems to have recovered somehow.
 
Well, I guess it's good news that the detection is working? Ken, how do you feel about moving forward with having the Chrome-GPU pool quarantine the affected machines? I can get a patch out in a few minutes.
Let's get the auto-quaranting in place ASAP!

I hope the detection code is accurate. We'll need to monitor the fleet to make sure that we don't accidentally auto-quarantine all the bots.

Project Member

Comment 3 by bugdroid1@chromium.org, Sep 21

The following revision refers to this bug:
  https://chrome-internal.googlesource.com/infradata/config/+/086e43efd1bb7fc5a6e8e5cbf6b010c71def85b2

commit 086e43efd1bb7fc5a6e8e5cbf6b010c71def85b2
Author: Brian Sheedy <bsheedy@google.com>
Date: Fri Sep 21 19:48:34 2018

Are you still seeing this? Any devices on Chrome-GPU should automatically be quarantined now instead of catching this at test run time, although I'm not seeing any that have gotten into that state yet.
No, I didn't see any new failures yet.
We can try bringing build706-m4 back online and see if it will be quarantined.
Sure, that would be a good test.
I've rebooted it, and now it's quarantined!
https://chromium-swarm.appspot.com/bot?id=build706-m4
Status: Fixed (was: Assigned)
Great, that tells us that the quarantining is indeed working, and that we shouldn't have to worry about it quarantining a large portion of the fleet accidentally since that's the only one that's been caught so far.

I'll keep monitoring for a bit longer and keep the other but open in the meantime, but I think that means this one is fixed.
Fantastic! Superb work Brian and thanks Yuly for reporting the problem and helping verify it!

Sign in to add a comment