New issue
Advanced search Search tips
Note: Color blocks (like or ) mean that a user may not be available. Tooltip shows the reason.

Issue 820119 link

Starred by 1 user

Issue metadata

Status: Fixed
Owner:
Closed: Mar 2018
Cc:
Components:
EstimatedDays: ----
NextAction: ----
OS: ----
Pri: 2
Type: Bug



Sign in to add a comment

cros-beefy388-c2 stuck in CleanBuildRoot for hours

Project Member Reported by pprabhu@chromium.org, Mar 8 2018

Issue description

config: reef-release
Build: https://uberchromegw.corp.google.com/i/chromeos/builders/reef-release/builds/2003

Has been stuck in CleanBuildRoot for ~15 hours.

Last relevant log line is: 

18:14:48: INFO: Checkout cleanup failed, wiping buildroot:
Traceback (most recent call last):
  File "/b/rr/tmpoCuOYR/w/chromite/scripts/cbuildbot_launch.py", line 199, in CleanBuildRoot
    repo.BuildRootGitCleanup(prune_all=True)
  File "/b/rr/tmpoCuOYR/w/chromite/cbuildbot/repository.py", line 412, in BuildRootGitCleanup
    git.ManifestCheckout.Cached(self.directory).ListCheckouts()]
  File "/b/rr/tmpoCuOYR/w/chromite/lib/git.py", line 771, in Cached
    search=search)
  File "/b/rr/tmpoCuOYR/w/chromite/lib/git.py", line 595, in _NormalizeArgs
    raise OSError(errno.ENOENT, "Couldn't find repo root: %s" % (path,))
OSError: [Errno 2] Couldn't find repo root: /b/c/cbuild/repository
18:14:48: INFO: RunCommand: sudo -- find /b/c/cbuild/repository -mindepth 1 -maxdepth 1 -exec rm -rf '{}' +
 
Cc: nxia@chromium.org dgarr...@chromium.org
I killed the build for a waterfall restart. But need to keep an eye on the next build on this slave + figure out what's really up.
Status: Unconfirmed (was: Untriaged)

Comment 3 by nxia@chromium.org, Mar 8 2018

still stuck. I forced a clobber restart.
It's a GCE instance (cros-beefy388-c2). I generally suggest reinstancing them when something like this happens, so you get back to a 100% known state.
Owner: dgarr...@chromium.org
Status: Started (was: Unconfirmed)
This is still happening.

That's expected. A clobber restart only affects cbuildbot's behavior, but this problem is happening during cbuildbot_launch.

I'm investigating the state of the builder so we can try to harden cbuildbot_launch, then I'll reinstance it.
I'm unable to ssh in to investigate. Reinstanced.

This should fix the problem, but doesn't explain it. Looks like some kind of general file system corruption?
Summary: ros-beefy388-c2 stuck in CleanBuildRoot for hours (was: Builder stuck in CleanBuildRoot for hours)
Summary: cros-beefy388-c2 stuck in CleanBuildRoot for hours (was: ros-beefy388-c2 stuck in CleanBuildRoot for hours)
Believed fixed, but waiting for the current build to get far enough along to confirm.

https://uberchromegw.corp.google.com/i/chromeos/builders/reef-release/builds/2006
Status: Fixed (was: Started)
2006 is on SetupBoard

Sign in to add a comment