New issue
Advanced search Search tips

Issue 818741 link

Starred by 1 user

Issue metadata

Status: Archived
Owner: ----
Closed: Jul 13
Cc:
Components:
EstimatedDays: ----
NextAction: ----
OS: ----
Pri: 3
Type: Bug



Sign in to add a comment

bob-paladin: buildslave lost connection with buildbot master (because of log silence?)

Project Member Reported by pprabhu@chromium.org, Mar 5 2018

Issue description

master CQ run: https://uberchromegw.corp.google.com/i/chromeos/builders/master-paladin/builds/17918
Failed build: https://luci-milo.appspot.com/buildbot/chromeos/bob-paladin/2465

Build slave message (interrupt):

[Failure instance: Traceback (failure with no frames): <class 'twisted.internet.error.ConnectionLost'>: Connection to the other side was lost in a non-clean fashion.
]

Failed in HWTest: 

08:27:20: INFO: Re-run swarming_cmd to avoid buildbot salency check.
08:27:20: INFO: RunCommand: /b/c/cbuild/repository/chromite/third_party/swarming.client/swarming.py run --swarming chromeos-proxy.appspot.com --task-summary-json /tmp/cbuildbot-tmp3TJF5P/tmpIgokZr/temp_summary.json --raw-cmd --task-name bob-paladin/R67-10461.0.0-rc3-bvt-arc --dimension os Ubuntu-14.04 --dimension pool default --print-status-updates --timeout 9000 --io-timeout 9000 --hard-timeout 9000 --expiration 1200 '--tags=priority:CQ' '--tags=suite:bvt-arc' '--tags=build:bob-paladin/R67-10461.0.0-rc3' '--tags=task_name:bob-paladin/R67-10461.0.0-rc3-bvt-arc' '--tags=board:bob' -- /usr/local/autotest/site_utils/run_suite.py --build bob-paladin/R67-10461.0.0-rc3 --board bob --suite_name suite_attr_wrapper --pool cq --file_bugs False --priority CQ --timeout_mins 90 --retry True --max_retries 5 --minimum_duts 4 --offload_failures_only False --suite_args "{'attr_filter': u'(suite:bvt-arc) and (subsystem:default)'}" --job_keyvals "{'cidb_build_stage_id': 72391773L, 'cidb_build_id': 2351527, 'datastore_parent_key': ('Build', 2351527, 'BuildStage', 72391773L)}" --test_args "{'fast': 'True'}" -m 181072838


[No output below this...]
 
Cc: xixuan@chromium.org
xixuan: I couldn't find the swarming task for this call at https://chromeos-proxy.appspot.com/

Am I looking at the right instance?
Owner: xixuan@chromium.org
Status: Assigned (was: Unconfirmed)
Assigning to xixuan to identify error
Cc: pprabhu@chromium.org ayatane@chromium.org
Components: Infra
Labels: Infra-Troopers
Owner: ----
Status: Available (was: Assigned)
The suite is kicked off by swarming proxy and finished:

https://chromeos-proxy.appspot.com/task?id=3c0fce873b3abb10&refresh=10&show_raw=1
https://chromeos-proxy.appspot.com/task?id=3c0fcead7b2cd610&refresh=10&show_raw=1

I think this builder is aborted due to some builder issue, the logging in 'interrupt' is: 

[Failure instance: Traceback (failure with no frames): <class 'twisted.internet.error.ConnectionLost'>: Connection to the other side was lost in a non-clean fashion.]

+deupties & trooper.
[trooper here] chromeos-proxy.appspot.com gives me a 403.  


Labels: -Infra-Troopers
Trooper checking in again; I get the same result as #4.

It's not clear what involvement you're looking for from troopers, and without access to the logs, I don't think troopers can help. I think we'd be happy to do so once the access issues are resolved; please feel free to re-add Infra-Troopers at that point.
Components: -Infra
Components: Infra>Client>ChromeOS>CI
Components: -Infra>Client>ChromeOS
Status: Archived (was: Available)
The bob-paladin information on this build stored in BuildBot is gone now so I can't inspect what killed the job. I'm closing this since we've lost the ability to root-cause it at this point.

Sign in to add a comment