samus: Host did not return from reboot |
||
Issue descriptionsamus-nyc-android-pfq failed: https://luci-milo.appspot.com/buildbot/chromeos/samus-nyc-android-pfq/1429 There's no clear reason from the test logs (the error logs are both blank for both the original run and the re-run), but the build failure does say this: cheets_ClobberStateful: ABORT: Host did not return from reboot Need to check in DUT history to see what went wrong when reboot was attempted.
,
Jan 13 2018
Also, stack of ClobberStateful failure:
Traceback (most recent call last):
File "/usr/local/autotest/server/server_job.py", line 1033, in run_op
op_func()
File "/usr/local/autotest/server/hosts/remote.py", line 160, in reboot
**dargs)
File "/usr/local/autotest/server/hosts/remote.py", line 229, in wait_for_restart
self.log_op(self.OP_REBOOT, op_func)
File "/usr/local/autotest/client/common_lib/hosts/base_classes.py", line 566, in log_op
op_func()
File "/usr/local/autotest/server/hosts/remote.py", line 228, in op_func
super(RemoteHost, self).wait_for_restart(timeout=timeout, **dargs)
File "/usr/local/autotest/client/common_lib/hosts/base_classes.py", line 310, in wait_for_restart
raise error.AutoservRebootError("Host did not return from reboot")
AutoservRebootError: Host did not return from reboot
FAIL cheets_ClobberStateful cheets_ClobberStateful timestamp=1515769112 localtime=Jan 12 06:58:32 Unhandled AutoservRebootError: Host did not return from reboot
Traceback (most recent call last):
File "/usr/local/autotest/client/common_lib/test.py", line 831, in _call_test_function
return func(*args, **dargs)
File "/usr/local/autotest/client/common_lib/test.py", line 495, in execute
dargs)
File "/usr/local/autotest/client/common_lib/test.py", line 362, in _call_run_once_with_retry
postprocess_profiled_run, args, dargs)
File "/usr/local/autotest/client/common_lib/test.py", line 400, in _call_run_once
self.run_once(*args, **dargs)
File "/usr/local/autotest/server/site_tests/cheets_ClobberStateful/cheets_ClobberStateful.py", line 36, in run_once
self.client.reboot()
File "/usr/local/autotest/server/hosts/cros_host.py", line 1520, in reboot
super(CrosHost, self).reboot(**dargs)
File "/usr/local/autotest/server/hosts/remote.py", line 164, in reboot
self.log_op(self.OP_REBOOT, reboot)
File "/usr/local/autotest/client/common_lib/hosts/base_classes.py", line 562, in log_op
self.job.run_op(op, op_func, self.get_kernel_ver)
File "/usr/local/autotest/server/server_job.py", line 1033, in run_op
op_func()
File "/usr/local/autotest/server/hosts/remote.py", line 160, in reboot
**dargs)
File "/usr/local/autotest/server/hosts/remote.py", line 229, in wait_for_restart
self.log_op(self.OP_REBOOT, op_func)
File "/usr/local/autotest/client/common_lib/hosts/base_classes.py", line 566, in log_op
op_func()
File "/usr/local/autotest/server/hosts/remote.py", line 228, in op_func
super(RemoteHost, self).wait_for_restart(timeout=timeout, **dargs)
File "/usr/local/autotest/client/common_lib/hosts/base_classes.py", line 310, in wait_for_restart
raise error.AutoservRebootError("Host did not return from reboot")
AutoservRebootError: Host did not return from reboot
END FAIL cheets_ClobberStateful cheets_ClobberStateful timestamp=1515769112 localtime=Jan 12 06:58:32
,
Jan 13 2018
Found another similar case for platform_PowerWash test. https://luci-milo.appspot.com/buildbot/chromeos/samus-release/5017 Looking at DUT history, I see the following reset + repair combo: https://pantheon.corp.google.com/storage/browser/chromeos-autotest-results/hosts/chromeos2-row3-rack10-host13/1694699-reset/20181101211402/ https://pantheon.corp.google.com/storage/browser/chromeos-autotest-results/hosts/chromeos2-row3-rack10-host13/1694717-repair/20181101211629/ And the repair also had to do some powercycling: START ---- repair timestamp=1515734197 localtime=Jan 11 21:16:37 GOOD ---- verify.servo_ssh timestamp=1515734200 localtime=Jan 11 21:16:40 GOOD ---- verify.brd_config timestamp=1515734202 localtime=Jan 11 21:16:42 GOOD ---- verify.ser_config timestamp=1515734202 localtime=Jan 11 21:16:42 GOOD ---- verify.job timestamp=1515734204 localtime=Jan 11 21:16:44 GOOD ---- verify.servod timestamp=1515734209 localtime=Jan 11 21:16:49 GOOD ---- verify.pwr_button timestamp=1515734209 localtime=Jan 11 21:16:49 GOOD ---- verify.lid_open timestamp=1515734209 localtime=Jan 11 21:16:49 GOOD ---- verify.update timestamp=1515734223 localtime=Jan 11 21:17:03 GOOD ---- verify.PASS timestamp=1515734223 localtime=Jan 11 21:17:03 FAIL ---- verify.ssh timestamp=1515734812 localtime=Jan 11 21:26:52 No answer to ping from chromeos2-row3-rack10-host13 START ---- repair.rpm timestamp=1515734812 localtime=Jan 11 21:26:52 FAIL ---- repair.rpm timestamp=1515735056 localtime=Jan 11 21:30:56 chromeos2-row3-rack10-host13 is still offline after powercycling END FAIL ---- repair.rpm timestamp=1515735056 localtime=Jan 11 21:30:56 START ---- repair.sysrq timestamp=1515735056 localtime=Jan 11 21:30:56 FAIL ---- repair.sysrq timestamp=1515735352 localtime=Jan 11 21:35:52 Host chromeos2-row3-rack10-host13 is still offline after sysrq. END FAIL ---- repair.sysrq timestamp=1515735352 localtime=Jan 11 21:35:52 START ---- repair.servoreset timestamp=1515735353 localtime=Jan 11 21:35:53 GOOD ---- verify.ssh timestamp=1515735377 localtime=Jan 11 21:36:17 END GOOD ---- repair.servoreset timestamp=1515735377 localtime=Jan 11 21:36:17 GOOD ---- verify.fwstatus timestamp=1515735377 localtime=Jan 11 21:36:17 GOOD ---- verify.good_au timestamp=1515735378 localtime=Jan 11 21:36:18 GOOD ---- verify.devmode timestamp=1515735379 localtime=Jan 11 21:36:19 GOOD ---- verify.writable timestamp=1515735380 localtime=Jan 11 21:36:20 GOOD ---- verify.tpm timestamp=1515735381 localtime=Jan 11 21:36:21 GOOD ---- verify.ext4 timestamp=1515735382 localtime=Jan 11 21:36:22 GOOD ---- verify.power timestamp=1515735383 localtime=Jan 11 21:36:23 GOOD ---- verify.rwfw timestamp=1515735384 localtime=Jan 11 21:36:24 FAIL ---- verify.python timestamp=1515735385 localtime=Jan 11 21:36:25 Python is missing; may be caused by powerwash GOOD ---- verify.cros timestamp=1515735395 localtime=Jan 11 21:36:35 START ---- repair.au timestamp=1515735395 localtime=Jan 11 21:36:35 GOOD ---- verify.ssh timestamp=1515735752 localtime=Jan 11 21:42:32 GOOD ---- verify.power timestamp=1515735756 localtime=Jan 11 21:42:36 GOOD ---- verify.rwfw timestamp=1515735757 localtime=Jan 11 21:42:37 GOOD ---- verify.python timestamp=1515735758 localtime=Jan 11 21:42:38 GOOD ---- verify.cros timestamp=1515735767 localtime=Jan 11 21:42:47 END GOOD ---- repair.au timestamp=1515735767 localtime=Jan 11 21:42:47 GOOD ---- verify.hwid timestamp=1515735769 localtime=Jan 11 21:42:49 GOOD ---- verify.PASS timestamp=1515735769 localtime=Jan 11 21:42:49 START ---- reboot timestamp=1515735769 localtime=Jan 11 21:42:49 GOOD ---- reboot.start timestamp=1515735769 localtime=Jan 11 21:42:49 GOOD ---- reboot.verify timestamp=1515735790 localtime=Jan 11 21:43:10 END GOOD ---- reboot kernel=3.14.0 localtime=Jan 11 21:43:15 timestamp=1515735795 INFO ---- repair timestamp=1515735795 localtime=Jan 11 21:43:15 Can't repair label 'pool:bvt'. INFO ---- repair timestamp=1515735795 localtime=Jan 11 21:43:15 Can't repair label 'board:samus'. INFO ---- repair timestamp=1515735795 localtime=Jan 11 21:43:15 Can't repair label 'cros-version:samus-release/R65-10301.0.0'. END GOOD ---- repair timestamp=1515735795 localtime=Jan 11 21:43:15 chromeos2-row3-rack10-host13 repaired successfully |
||
►
Sign in to add a comment |
||
Comment 1 by linben@chromium.org
, Jan 13 2018