New issue
Advanced search Search tips
Note: Color blocks (like or ) mean that a user may not be available. Tooltip shows the reason.

Issue 863539 link

Starred by 4 users

Issue metadata

Status: Duplicate
Merged: issue 865214
Owner:
Closed: Jul 18
Cc:
EstimatedDays: ----
NextAction: ----
OS: ----
Pri: 3
Type: Bug



Sign in to add a comment

Reboot during StartAndroid.stress test

Project Member Reported by jettrink@chromium.org, Jul 13

Issue description

There have been 2 somewhat recent CQ builder failures due to the DUT restarting in the middle of StartAndroid.stress. I couldn't really find anything that helpful in the logs.

https://luci-milo.appspot.com/buildbot/chromeos/veyron_mighty-paladin/9285
https://luci-milo.appspot.com/buildbot/chromeos/elm-paladin/6733
 
It's also not clear to me what is causing this, the system reboots orderly, not due to a kernel crash.

From a Moma search on 'cheets_StartAndroid "DUT rebooted during the test run"' it seems the issue has been around for > 6 months.

e.g. this is an instance on auron_paine from 02/22/18:

https://stainless.corp.google.com/browse/chromeos-autotest-results/178827513-chromeos-test/
I'm bit confused about the failure on veyron_mighty:

Supposedly the test ran on chromeos4-row6-rack11-host21. I ssh-ed into that device hoping to find some additional info in the local logs, however /var/log/messages doesn't match with the info on stainless (https://stainless.corp.google.com/browse/chromeos-autotest-results/216901379-chromeos-test/). It also reports an uptime > 13h, while the failure was reported about 3h ago (10:52:27):

uptime
 13:48:51 up  3:11,  0 users,  load average: 1.02, 0.86, 0.86

Am I just embarrassing myself looking at the wrong device or is there something weird here?
logs.tar.gz
270 KB Download
dmesg.log
57.3 KB View Download
I'm at least in part embarrassing myself, the uptime actually says 3:11 hours ... Will do another attempt to match the logs.
The first part of /var/log/messages actually matches, I got confused trying to match the kernel messages in /sys/fs/pstore/console-ramoops with /var/log/messages, which indeed don't match. Looks like a new log file was started after the mysterious reboot.
Cc: rrangel@chromium.org pmalani@chromium.org
Copying current sheriffs
Owner: domlasko...@chromium.org
Status: Assigned (was: Untriaged)
Kicking to current arc constable, though at this point this might be obsolete / wontfixable. This is unlikely to be an infra issue.
Mergedinto: 865214
Status: Duplicate (was: Assigned)

Sign in to add a comment