New issue
Advanced search Search tips
Note: Color blocks (like or ) mean that a user may not be available. Tooltip shows the reason.

Issue 823799 link

Starred by 1 user

Issue metadata

Status: Fixed
Owner:
Last visit > 30 days ago
Closed: Mar 2018
Cc:
EstimatedDays: ----
NextAction: ----
OS: Chrome
Pri: 3
Type: Bug



Sign in to add a comment

Unexplained string of timeouts in CtsAccelerationTestCases

Project Member Reported by sha...@chromium.org, Mar 20 2018

Issue description

Eg:

https://luci-milo.appspot.com/buildbot/chromeos/auron_paine-release/2031
https://luci-milo.appspot.com/buildbot/chromeos/coral-release/848
https://luci-milo.appspot.com/buildbot/chromeos/fizz-release/1093
https://luci-milo.appspot.com/buildbot/chromeos/nautilus-release/470

I'm not sure if this info is meaningful:

  Suite job                                 [ FAILED ]
  Suite job                                   ABORT: 
...
01:18:34: ERROR: wait_cmd has lab failures: cmd=['/b/c/cbuild/repository/chromite/third_party/swarming.client/swarming.py', 'run', '--swarming', 'chromeos-proxy.appspot.com', '--task-summary-json', '/tmp/cbuildbot-tmpphVNDr/tmpH7Tsml/temp_summary.json', '--raw-cmd', '--task-name', u'nautilus-release/R67-10503.0.0-bvt-arc', '--dimension', 'os', 'Ubuntu-14.04', '--dimension', 'pool', 'default', '--print-status-updates', '--timeout', '14400', '--io-timeout', '14400', '--hard-timeout', '14400', '--expiration', '1200', u'--tags=priority:Build', u'--tags=suite:bvt-arc', u'--tags=build:nautilus-release/R67-10503.0.0', u'--tags=task_name:nautilus-release/R67-10503.0.0-bvt-arc', u'--tags=board:nautilus', '--', '/usr/local/autotest/site_utils/run_suite.py', '--build', u'nautilus-release/R67-10503.0.0', '--board', u'nautilus', '--suite_name', u'bvt-arc', '--pool', u'bvt', '--file_bugs', 'True', '--priority', 'Build', '--timeout_mins', '180', '--retry', 'True', '--max_retries', '5', '--minimum_duts', '4', '--suite_min_duts', '6', '--offload_failures_only', 'False', '--job_keyvals', "{'cidb_build_stage_id': 73905518L, 'cidb_build_id': 2396732, 'datastore_parent_key': ('Build', 2396732, 'BuildStage', 73905518L)}", '-m', '184953758'].

 

Comment 1 by sha...@chromium.org, Mar 20 2018

The builders mostly cycled green, so this isn't urgent, but it would be nice to know what happened.
Cc: kroot@chromium.org domlasko...@chromium.org
Components: Platform>ARC
Owner: domlasko...@chromium.org
Status: Assigned (was: Untriaged)
>   Suite job                                 [ FAILED ]
>   Suite job                                   ABORT: 

This is the proximate cause of the failure.  It means that the suite
job timed out and aborted.  Normally, the logs that follow should
show which individual tests were affected.

The output from run_suite shows only PASSED test results, but that's a
lie; if you look at the actual suite jobs (not the output from run_suite)
you find that in every case, cheets_CTS_N.7.1_r15.x86.CtsAccelerationTestCases
aborted.  That's likely the cause of the overall suite timeout.

So, the root cause of the problem is a failure in the CTS acceleration
test.  Secondarily, there's a bug that our logs didn't actually tell us so.

Given that the problem went away quickly, the first question to ask would
be "did anyone find and revert a change causing that sort of failure?"

For reference, the auron_paine suite job that aborted:
    http://cautotest-prod/afe/#tab_id=view_job&object_id=184954122

And the job for the failed acceleration test:
    http://cautotest-prod/afe/#tab_id=view_job&object_id=184954238

Comment 4 by uekawa@chromium.org, Mar 22 2018

Labels: ArcConstable
Summary: Unexplained string of timeouts in CtsAccelerationTestCases (was: Widespread canary failure on 3/20 early AM)
Updating the summary to reflect the symptom better.

Owner: levarum@chromium.org
Cc: takaoka@chromium.org
Status: Fixed (was: Assigned)
Since test is passing for the while, I assume it was fully recovered.

Sign in to add a comment