New issue
Advanced search Search tips
Note: Color blocks (like or ) mean that a user may not be available. Tooltip shows the reason.

Issue 900766 link

Starred by 1 user

Issue metadata

Status: WontFix
Owner:
Closed: Nov 5
Cc:
Components:
EstimatedDays: ----
NextAction: ----
OS: Chrome
Pri: 2
Type: Bug



Sign in to add a comment

[Autotests not running] nocturne not running various OOB test suites as expected

Project Member Reported by harpreet@chromium.org, Oct 31

Issue description

chromeos-infra, need your help in checking why the following suites are not running on nocturne as scheduled...

bluetooth_sanity - should run nightly on tot
https://cs.corp.google.com/chromeos_public/infra/suite_scheduler/configs/suite_scheduler.ini?l=176


wifi_endtoend - should run nightly on tot
https://cs.corp.google.com/chromeos_public/infra/suite_scheduler/configs/suite_scheduler.ini?l=616


wifi_matfunc - should run nightly on tot
https://cs.corp.google.com/chromeos_public/infra/suite_scheduler/configs/suite_scheduler.ini?l=594


wifi_perf - should run nightly on tot
https://cs.corp.google.com/chromeos_public/infra/suite_scheduler/configs/suite_scheduler.ini?l=627


From this stainless link looks like wifi_* suites have been getting scheduled to but are not run most of the days. For bluetooth_sanity, looks like its not even getting scheduled on a nightly basis.

https://stainless.corp.google.com/search?view=matrix&row=test&col=build&first_date=2018-10-04&last_date=2018-10-31&suite=bluetooth_sanity%7Cwifi_matfunc%7Cwifi_endtoend%7Cwifi_perf&model=%5Enocturne%24&exclude_cts=true&exclude_not_run=false&exclude_non_release=true&exclude_au=true&exclude_acts=true&exclude_retried=true&exclude_non_production=false



Can someone on chromeos-infra please take a look or point me to where I can look to see what the issue might be?
 
Owner: zamorzaev@chromium.org
Status: Assigned (was: Untriaged)
-> deputy

Most likely cause of this in lack of working duts.
Labels: Hotlist-Deputy
Status: Started (was: Assigned)
I don't think it's simply a lack of duts: there are 10 nocturne duts in the "Ready" state in the pool:suites right now. 

I'm looking into it.
Cc: shijinabraham@chromium.org
I didn't pay attention earlier, these suites use pool:wificell which currently has a single dut chromeos15-row4-rack11-host3 which has been failing repair for a couple of days.
Maybe this is merely a question of corrupted pool labels? (why should any pool have just one DUT?)
This is not an auto-managed pool and judging by the name "wificell" the duts require a very special placement/treatment.
Correct there is only 1 nocturne dut in wificell pool and the device has been down since yesterday. Have asked the lab folks to power cycle it.

The issue reported here has been happening for a while though; not related to the recent repair failed state.
Is a single dut enough for the throughput of that pool?
Labels: -Pri-1 Pri-2
Looks like lots of jobs ran in the last 2-3 days: https://stainless.corp.google.com/search?view=matrix&row=test&col=queued_date&first_date=2018-10-23&last_date=2018-11-05&suite=bluetooth_sanity%7Cwifi_matfunc%7Cwifi_endtoend%7Cwifi_perf&model=%5Enocturne%24&exclude_cts=true&exclude_not_run=false&exclude_non_release=true&exclude_au=true&exclude_acts=true&exclude_retried=true&exclude_non_production=false


Also, looking at the shard jobs for some the jobs that didn't run earlier, I think it indeed is down to just that one DUT being repeatedly in repair_failed state. The shard can't find a DUT to match, so the job stays queued and times out.
http://chromeos-skunk-3.mtv.corp.google.com/afe/#tab_id=view_job&object_id=255479121
http://chromeos-skunk-3.mtv.corp.google.com/afe/#tab_id=view_job&object_id=255461689

bluetooth_nightly were still not scheduled until today, and I can't explain that.
Status: WontFix (was: Started)
Ah, bluetooth_sanity is actually scheduled weekly: http://shortn/_SzqlkokTgP

That's because the nightly suite's whitelist doesn't have nocturne: https://cs.corp.google.com/chromeos_public/infra/suite_scheduler/configs/suite_scheduler.ini?l=184

So, WAI.
pprabhu@ thanks for taking a look. You are right about nocturne missing from the nightly suite whitelist. I just found that out today as well.


Any input on "Is there a way to find out how long each test takes to run on a given device?" ?
Project Member

Comment 13 by bugdroid1@chromium.org, Nov 7

The following revision refers to this bug:
  https://chromium.googlesource.com/chromiumos/infra/suite_scheduler/+/c546e402a27f5d1665706fb48b3ae38cf332d52c

commit c546e402a27f5d1665706fb48b3ae38cf332d52c
Author: Shijin Abraham <shijinabraham@google.com>
Date: Wed Nov 07 14:34:20 2018

Add nocturne to bluetooth_sanity
Remove no_delay from Wifi_Release

BUG= chromium:900766 
TEST=None

Change-Id: If5f7d5918f243c4aaabc1277824986c4c79e97f8
Reviewed-on: https://chromium-review.googlesource.com/1318098
Commit-Ready: ChromeOS CL Exonerator Bot <chromiumos-cl-exonerator@appspot.gserviceaccount.com>
Tested-by: Shijin Abraham <shijinabraham@google.com>
Reviewed-by: Harpreet Grewal <harpreet@chromium.org>

[modify] https://crrev.com/c546e402a27f5d1665706fb48b3ae38cf332d52c/configs/suite_scheduler.ini

Sign in to add a comment