New issue
Advanced search Search tips
Note: Color blocks (like or ) mean that a user may not be available. Tooltip shows the reason.

Issue 600500 link

Starred by 1 user

Issue metadata

Status: Archived
Owner:
Last visit > 30 days ago
Closed: Aug 2016
Cc:
Components:
EstimatedDays: ----
NextAction: ----
OS: Chrome
Pri: 1
Type: Bug



Sign in to add a comment

Frequent suite job timeouts during build

Project Member Reported by kathrelk...@chromium.org, Apr 4 2016

Issue description

E.g. https://uberchromegw.corp.google.com/i/chromeos/builders/jecht-release-group/builds/1356

There have been a lot of similar bvt_inline timeouts recently.  After the last test, the suite times out - even though the test seems to have passed.

Example on guado: https://uberchromegw.corp.google.com/i/chromeos/builders/jecht-release-group/builds/1356/steps/HWTest%20%5Bguado%5D%20%5Bbvt-inline%5D/logs/stdio

login_GuestAndActualSession: http://cautotest.corp.google.com/afe/#tab_id=view_job&object_id=58960906
The suite job has another 0:59:14.691822 till timeout.
The suite job has another 0:29:04.965304 till timeout.
The suite job has another -1 day, 23:58:58.914383 till timeout.
Suite timed out. Started on 04-04-2016 [07:37:09], timed out on 04-04-2016 [09:41:04]
Suite job                               [ FAILED ]
Suite job                                 ABORT:


No reference platform in particular.  For example, on M51-8150.0.0 alone, this same failure showed up on 23 boards: gandof, panther, monroe, stout, guado, rikku, sentry, big, pi, mario, mickey, banjo, sumo, glimmer, quawks, heli, squawks, winky, parrot, falco_li, celes, ultima, and cyan.  Reks and terra had the same failure with a different ending test.


Also happens for paygen tests.  
Example on a M50 buddy build.  Tests pass but suite is aborted:
https://uberchromegw.corp.google.com/i/chromeos_release/builders/auron-b-release-group%20release-R50-7978.B/builds/38
 
Cc: jrbarnette@chromium.org
Labels: OS-Chrome
+jrbarnette
Cc: abhishekbh@chromium.org dgarr...@chromium.org vapier@chromium.org
 Issue 600506  has been merged into this issue.
CQ also has some suite timeout issue: 

https://bugs.chromium.org/p/chromium/issues/detail?id=600022

Not sure whether it's related.
Cc: aaboagye@chromium.org
The canaries are still running into this issue. There do seem to be underlying issues that others are looking into (provision failures, SSH reporting a generic error.)

But, that should be different from the suite timing out, right?
canaries has larger probabilities to be suite timeout, since there exist no enough shards for it, which leads to slow processing.

Infra team is adding more shards.
Re#5, is there a bug tracking that effort?
no actually, it's in our OKR in this quarter.

Comment 8 by autumn@chromium.org, Apr 11 2016

Owner: dshi@chromium.org
related to canary timeouts? 

Comment 9 by dshi@chromium.org, Apr 11 2016

Re #8, yes, the bvt-inline suites are related to canary hwtest.
Components: Infra>Client>ChromeOS
Labels: -Infra-ChromeOS
Labels: bvttriage
This is still happening: bvt-inline failures cause build timeout.
Could problems today be related to the number of release builds running as tryjobs?
Status: Assigned (was: Untriaged)
Dan, is this still an issue?  If so, please assign and prioritize.

Comment 15 by dshi@chromium.org, Aug 4 2016

Status: Fixed (was: Assigned)
haven't seen for a while. Since the bug filed we have added many shards, now total at 43. The performance should be much better now. Close the bug for now.
Labels: VerifyIn-54
Status: Assigned (was: Fixed)
Seeing this issue again in last two builds of veyron_mighty-paladin HWTest [bvt-inline]:

https://uberchromegw.corp.google.com/i/chromeos/builders/veyron_mighty-paladin/builds/2988/steps/HWTest%20%5Bbvt-inline%5D/logs/stdio
https://uberchromegw.corp.google.com/i/chromeos/builders/veyron_mighty-paladin/builds/2989/steps/HWTest%20%5Bbvt-inline%5D/logs/stdio


  The suite job has another 0:29:44.595083 till timeout.
  The suite job has another -1 day, 23:59:42.196116 till timeout.
  Suite timed out. Started on 08-29-2016 [21:46:44], timed out on 08-29-2016 [23:26:04]
  Suite job                               [ FAILED ]
  Suite job                                 ABORT: 

Comment 18 by dshi@chromium.org, Aug 30 2016

Status: Fixed (was: Assigned)
Please open new bugs for this failure. From the job:
http://cautotest.corp.google.com/afe/#tab_id=view_job&object_id=75012916
FAIL	security_NetworkListeners	security_NetworkListeners	timestamp=1472521827	localtime=Aug 29 18:50:27	Android did not boot!

The test failed already, the timeout is caused by crash dump collection/symbolication.


Labels: VerifyIn-55

Comment 20 by dchan@chromium.org, Oct 10 2016

Labels: -VerifyIn-55

Comment 21 by dchan@google.com, Nov 19 2016

Labels: VerifyIn-56

Comment 22 by dchan@google.com, Jan 21 2017

Labels: VerifyIn-57

Comment 23 by dchan@google.com, Mar 4 2017

Labels: VerifyIn-58

Comment 24 by dchan@google.com, Apr 17 2017

Labels: VerifyIn-59

Comment 25 by dchan@google.com, May 30 2017

Labels: VerifyIn-60
Labels: VerifyIn-61

Comment 27 by dchan@chromium.org, Oct 14 2017

Status: Archived (was: Fixed)

Sign in to add a comment