New issue
Advanced search Search tips
Note: Color blocks (like or ) mean that a user may not be available. Tooltip shows the reason.

Issue 664505 link

Starred by 5 users

Issue metadata

Status: Archived
Owner:
Closed: Aug 2017
Cc:
Components:
EstimatedDays: ----
NextAction: ----
OS: ----
Pri: 1
Type: Bug


Sign in to add a comment

System Health benchmark failures on different bots for different reasons

Project Member Reported by perezju@chromium.org, Nov 11 2016

Issue description

There appear to be a lot of different things causing various system health benchmarks (mobile/desktop, common/memory) to fail on different bots.

Starting with this catch-all bug trying to organize and fix things.

Some of the issues that have been seen so far:

- Benchmark timing out.
- Data not making it to perf dashboard
- Individual stories flaking
- Individual stories failing
- Individual stories crashing
- Individual stories crashing and causing the benchmark to be aborted.
- Logs are huge are very spammy, making it all very hard to diagnose.

 
Thanks for opening this bug, Juan. We should also investigate which failures are not caught by the smoke test on CQ.
Blockedon: 664132
Blockedon: 663899 663900 655688
Blockedon: 664515
Blockedon: 664518
Blockedon: 664521
Blockedon: 664541
I have a https://codereview.chromium.org/2504653002/ which cleans up some verbose logging at the end of each telemetry run by switching it from warning/critical/info to debug so that on local runs we can still enable the logging if we need more information. Its not a very thorough job of cleaning the lobs, just the low hanging fruit that probably will get the least amount of push back from removing from each telemetry run. For more logging to be removed, I think we should have a deeper discussion. I don't mind starting that discussion if no one else is already doing so or would be better suited for it.
Project Member

Comment 9 by bugdroid1@chromium.org, Nov 15 2016

The following revision refers to this bug:
  https://chromium.googlesource.com/chromium/src.git/+/67aaeb3a6e67033a8187de1e863b97ba92d85ed9

commit 67aaeb3a6e67033a8187de1e863b97ba92d85ed9
Author: catapult-deps-roller <catapult-deps-roller@chromium.org>
Date: Tue Nov 15 21:58:26 2016

Roll src/third_party/catapult/ 54dd0fc86..8621fc142 (4 commits).

https://chromium.googlesource.com/external/github.com/catapult-project/catapult.git/+log/54dd0fc86f01..8621fc142fbb

$ git log 54dd0fc86..8621fc142 --date=short --no-merges --format='%ad %ae %s'
2016-11-15 charliea Kill per frame power metrics
2016-11-15 rnephew [Telemetry] Decrease end of test run logging.
2016-11-15 benjhayden Make valueset2html take MreResults.
2016-11-15 benjhayden Override overview line charts dataRange.

BUG= 664505 

Documentation for the AutoRoller is here:
https://skia.googlesource.com/buildbot/+/master/autoroll/README.md

If the roll is causing failures, see:
http://www.chromium.org/developers/tree-sheriffs/sheriff-details-chromium#TOC-Failures-due-to-DEPS-rolls

CQ_INCLUDE_TRYBOTS=master.tryserver.chromium.android:android_optional_gpu_tests_rel
TBR=catapult-sheriff@chromium.org

Review-Url: https://codereview.chromium.org/2497893004
Cr-Commit-Position: refs/heads/master@{#432265}

[modify] https://crrev.com/67aaeb3a6e67033a8187de1e863b97ba92d85ed9/DEPS

Blockedon: 665465 664661
Blockedon: 666807
Blockedon: 668097
Blockedon: 668098

Comment 14 by ajha@chromium.org, Nov 24 2016

Labels: Performance-Sheriff-BotHealth
Labels: -Pri-3 Hotlist-MemoryInfra Pri-1
Owner: perezju@chromium.org
Status: Assigned (was: Untriaged)
Blockedon: 668144
Blockedon: 657433
Blockedon: 657454
Blockedon: 651198
Blockedon: 668247
Blockedon: 671018
Cc: sullivan@chromium.org benhenry@chromium.org perezju@chromium.org
 Issue 668247  has been merged into this issue.
The bulk of the remaining failures appear to be from gmail pages.

https://build.chromium.org/p/chromium.perf/builders/Android%20Nexus5%20Perf%20%281%29/builds/4816
[  FAILED  ] 2 tests, listed below:
[  FAILED  ]  long_running:tools:gmail-foreground
[  FAILED  ]  long_running:tools:gmail-background

https://build.chromium.org/p/chromium.perf/builders/Android%20Nexus5%20Perf%20%282%29/builds/4511
[  FAILED  ] 1 test, listed below:
[  FAILED  ]  long_running:tools:gmail-background

https://build.chromium.org/p/chromium.perf/builders/Android%20Nexus5X%20Perf%20%281%29/builds/3943
[  FAILED  ] 1 test, listed below:
[  FAILED  ]  load:tools:gmail

https://build.chromium.org/p/chromium.perf/builders/Android%20Nexus5X%20Perf%20%282%29/builds/2458
[  FAILED  ] 1 test, listed below:
[  FAILED  ]  long_running:tools:gmail-foreground

https://build.chromium.org/p/chromium.perf/builders/Android%20Nexus6%20Perf%20%281%29/builds/4519
[  FAILED  ] 1 test, listed below:
[  FAILED  ]  background:tools:gmail

https://build.chromium.org/p/chromium.perf/builders/Android%20Nexus6%20Perf%20%282%29/builds/4393
[  FAILED  ] 1 test, listed below:
[  FAILED  ]  load:tools:gmail

https://build.chromium.org/p/chromium.perf/builders/Android%20Nexus7v2%20Perf%20%281%29/builds/4332
[  FAILED  ] 1 test, listed below:
[  FAILED  ]  background:tools:gmail

https://build.chromium.org/p/chromium.perf/builders/Android%20Nexus7v2%20Perf%20%282%29/builds/3445
[  FAILED  ] 1 test, listed below:
[  FAILED  ]  background:tools:gmail

https://build.chromium.org/p/chromium.perf/builders/Android%20Nexus9%20Perf%20%281%29/builds/3798
[  FAILED  ] 2 tests, listed below:
[  FAILED  ]  background:tools:gmail
[  FAILED  ]  load:tools:gmail

https://build.chromium.org/p/chromium.perf/builders/Android%20One%20Perf%20%281%29/builds/4565
[  FAILED  ] 1 test, listed below:
[  FAILED  ]  background:tools:gmail

https://build.chromium.org/p/chromium.perf/builders/Android%20One%20Perf%20%282%29/builds/5105
[  FAILED  ] 1 test, listed below:
[  FAILED  ]  background:tools:gmail

This is not an exhaustive list of all the failures in system health mobile.
I did a deeper dive into just the Nexus 5 failures and they look like this over the past 30 runs (skipping runs that failed due to infra issues):

TL;DR:
[  FAILED  ]  background:tools:gmail                - 10
[  FAILED  ]  blank:about:blank                     -  1
[  FAILED  ]  browse:news:nytimes                   -  6
[  FAILED  ]  browse:news:qq                        -  3
[  FAILED  ]  load:tools:gmail                      -  9
[  FAILED  ]  long_running:tools:gmail-background   -  8
[  FAILED  ]  long_running:tools:gmail-foreground   -  8

https://build.chromium.org/p/chromium.perf/builders/Android%20Nexus5%20Perf%20%281%29/builds/4817
[  FAILED  ]  browse:news:nytimes

https://build.chromium.org/p/chromium.perf/builders/Android%20Nexus5%20Perf%20%281%29/builds/4816
[  FAILED  ]  long_running:tools:gmail-foreground
[  FAILED  ]  long_running:tools:gmail-background

https://build.chromium.org/p/chromium.perf/builders/Android%20Nexus5%20Perf%20%281%29/builds/4815
[  FAILED  ]  long_running:tools:gmail-background

https://build.chromium.org/p/chromium.perf/builders/Android%20Nexus5%20Perf%20%281%29/builds/4814
[  FAILED  ]  browse:news:nytimes

https://build.chromium.org/p/chromium.perf/builders/Android%20Nexus5%20Perf%20%281%29/builds/4813
[  FAILED  ]  background:tools:gmail
[  FAILED  ]  long_running:tools:gmail-foreground
[  FAILED  ]  background:tools:gmail
[  FAILED  ]  load:tools:gmail

https://build.chromium.org/p/chromium.perf/builders/Android%20Nexus5%20Perf%20%281%29/builds/4812
[  FAILED  ]  long_running:tools:gmail-background

https://build.chromium.org/p/chromium.perf/builders/Android%20Nexus5%20Perf%20%281%29/builds/4811
[  FAILED  ]  load:tools:gmail
[  FAILED  ]  browse:news:nytimes

https://build.chromium.org/p/chromium.perf/builders/Android%20Nexus5%20Perf%20%281%29/builds/4810
[  FAILED  ]  long_running:tools:gmail-background
[  FAILED  ]  browse:news:qq

https://build.chromium.org/p/chromium.perf/builders/Android%20Nexus5%20Perf%20%281%29/builds/4809
[  FAILED  ]  load:tools:gmail
[  FAILED  ]  browse:news:qq

https://build.chromium.org/p/chromium.perf/builders/Android%20Nexus5%20Perf%20%281%29/builds/4808
[  FAILED  ]  long_running:tools:gmail-foreground

https://build.chromium.org/p/chromium.perf/builders/Android%20Nexus5%20Perf%20%281%29/builds/4807
[  FAILED  ]  long_running:tools:gmail-foreground
[  FAILED  ]  browse:news:qq

https://build.chromium.org/p/chromium.perf/builders/Android%20Nexus5%20Perf%20%281%29/builds/4806
[  FAILED  ]  background:tools:gmail
[  FAILED  ]  load:tools:gmail

https://build.chromium.org/p/chromium.perf/builders/Android%20Nexus5%20Perf%20%281%29/builds/4805
Passed

https://build.chromium.org/p/chromium.perf/builders/Android%20Nexus5%20Perf%20%281%29/builds/4804
[  FAILED  ]  background:tools:gmail
[  FAILED  ]  long_running:tools:gmail-foreground
[  FAILED  ]  background:tools:gmail

https://build.chromium.org/p/chromium.perf/builders/Android%20Nexus5%20Perf%20%281%29/builds/4803
[  FAILED  ]  long_running:tools:gmail-foreground
[  FAILED  ]  long_running:tools:gmail-foreground
[  FAILED  ]  browse:news:nytimes

https://build.chromium.org/p/chromium.perf/builders/Android%20Nexus5%20Perf%20%281%29/builds/4802
[  FAILED  ]  background:tools:gmail
[  FAILED  ]  long_running:tools:gmail-background

https://build.chromium.org/p/chromium.perf/builders/Android%20Nexus5%20Perf%20%281%29/builds/4801
[  FAILED  ]  background:tools:gmail
[  FAILED  ]  browse:news:nytimes

https://build.chromium.org/p/chromium.perf/builders/Android%20Nexus5%20Perf%20%281%29/builds/4800
[  FAILED  ]  background:tools:gmail

https://build.chromium.org/p/chromium.perf/builders/Android%20Nexus5%20Perf%20%281%29/builds/4799
[  FAILED  ]  background:tools:gmail
[  FAILED  ]  load:tools:gmail
[  FAILED  ]  long_running:tools:gmail-foreground

https://build.chromium.org/p/chromium.perf/builders/Android%20Nexus5%20Perf%20%281%29/builds/4798
pass

https://build.chromium.org/p/chromium.perf/builders/Android%20Nexus5%20Perf%20%281%29/builds/4797
[  FAILED  ]  load:tools:gmail
[  FAILED  ]  blank:about:blank

https://build.chromium.org/p/chromium.perf/builders/Android%20Nexus5%20Perf%20%281%29/builds/4796
[  FAILED  ]  load:tools:gmail
[  FAILED  ]  long_running:tools:gmail-background
[  FAILED  ]  browse:news:nytimes

https://build.chromium.org/p/chromium.perf/builders/Android%20Nexus5%20Perf%20%281%29/builds/4794
[  FAILED  ]  background:tools:gmail

https://build.chromium.org/p/chromium.perf/builders/Android%20Nexus5%20Perf%20%281%29/builds/4791
[  FAILED  ]  load:tools:gmail
[  FAILED  ]  long_running:tools:gmail-background

https://build.chromium.org/p/chromium.perf/builders/Android%20Nexus5%20Perf%20%281%29/builds/4790
[  FAILED  ]  load:tools:gmail
[  FAILED  ]  long_running:tools:gmail-background

https://build.chromium.org/p/chromium.perf/builders/Android%20Nexus5%20Perf%20%281%29/builds/4789
pass

Did some data collection of the past 20 runs on all android platforms:

https://docs.google.com/a/google.com/spreadsheets/d/1VuwbtHXD9dscFl9I3vsghuRZWjDBwmxRdt1hAWNRuFs/edit?usp=sharing
After some hacking and scripting I've got the list of all stories that have failed in a bot for at least two different builds in the past 20 builds (or whatever I could grab from logdog).

[A number indicates number of failures in a single build, a '-' means that the story succeeded all 3 page set repeats, and a '?' means that the story didn't run there.]


## browse:search:google
win-7     33333333333333333-33
win-7-x64 333333333333333333-3

Conclusion: Failing consistently, should be disabled on windows.


## load:tools:gmail
android-nexus7v2                   1---1-1---1------13
android-nexus9                     ---------1---1---111
android-one                        ------11---------11-
health-plan-clankium-low-end-phone -11---11-------1-1--
health-plan-clankium-phone         ---12---1-11------11

Conclusion: Very flaky, just got (actually) disabled on Android.


Sadly, all the others are stories flaking on just one or two bots, but not in all others. Next step will be to dig through the logs of those failures, and try to figure out if there are any common causes.


## background:news:nytimes
android-nexus5 [ref] --1--1?--1-1--11

## background:search:google
android-one [ref] ----?-11??-?--??----

## browse:media:imgur
android-nexus6 [ref] -----?-1?-11--1-----

## browse:media:youtube
android-nexus7v2 1-----------1--1---

## browse:news:flipboard
android-nexus6       ?--------?--?---1--1
android-nexus6 [ref] -----?-??-??-1-1----

## browse:news:hackernews
android-one [ref] ----?---??1?-1??----

## browse:news:nytimes
android-nexus5       -1-?1--1--1-1---
android-nexus6 [ref] -----1--1----?------

## browse:news:qq
android-nexus5 -?-?--1?1-?--1--

## browse:social:twitter
android-nexus6 ---------1--1-------

## load:media:imgur
android-nexus9 [ref] 1?--?----??----1----

## load:news:bbc
android-nexus6 [ref] -1--1?--?----?------

## load:news:nytimes
android-nexus9 [ref] -1-111---?--1-------

## load:news:sohu
android-one [ref] ----1---11-11-11----

## load:social:instagram
android-nexus9 [ref] -?--?--11?1---------


Some of the (?) also seem to come from a test failing and the whole of the benchmark bailing out. We'll also need to figure out what is going on there.
Blockedon: 673775
Hmm. I wonder if there is a way to transform the data into a chrome trace for better visualization. 
 Issue 666807  has been merged into this issue.
Blockedon: 674880
Issue 675002 has been merged into this issue.
Blockedon: 676315
Just did a new run of my script, and things are starting to look a lot better.

I see the following patterns:

# Failures on android-nexus9

Something was probably broken on this bot, but appears to have recovered now.

android-nexus9       background:news:nytimes  ------------33
android-nexus9       background:search:google ------------12
android-nexus9       search:portal:google     ------------33

# Flakiness on ref builds

Probably expected since the ref build doesn't have the latest Chrome fixes? When is the next roll of the ref build expected to happen?

android-one [ref]    browse:news:hackernews   1--1?-??---?
android-one [ref]    load:news:sohu           ---11-11---1
android-nexus6 [ref] browse:media:imgur       ---1---?-1-
android-nexus6 [ref] browse:news:flipboard    -1----1?--1
android-nexus6 [ref] load:news:bbc            --1----1---
android-nexus9 [ref] load:media:flickr        -----1-1------
android-nexus9 [ref] load:news:nytimes        1---1------1--

# Flaky browse:news:nytimes on android-nexus5

And we're down to a single story on a single bot which perhaps merits further investigation. Filed  issue 676315  to track this.

android-nexus5       browse:news:nytimes      1--11-1111-
Blockedon: 676336
Blockedon: 676338
Blockedon: 676341
Blockedon: 676342
Blockedon: 656040
Blockedon: 661775
Blockedon: 663025
Blockedon: 657665
Blockedon: 665007
Blockedon: 662971
Blockedon: 649392
Blockedon: 642463
Cc: -perezju@chromium.org eyaich@chromium.org
I'm guessing some of Emily's group's work had something to do with bots working and being green.

I'm not sure about ref build rolls, but I'm also interested.
Project Member

Comment 47 by bugdroid1@chromium.org, Dec 22 2016

Blocking: 652203
Components: Internals>Instrumentation>Memory
Hi.
I've noticed that all media stories (play:media) have no memory dumps on mac:
https://chromeperf.appspot.com/report?sid=74030f6d5104c014563cb69b347462a7d5fb6800911618ab55eceed0a1f37b4f
Blockedon: 688995
Cc: erikc...@chromium.org
+erikchen, is there a bug for #50?
I've filed issue 688995 for that one. Sorry I forgot to update here.
Blockedon: 691654
Blockedon: 696675
Blockedon: 704197
Wanted to give a quick update on the situation here.

                 test_runs failed  infra missing story_runs failed missing
2017-03-29 09:12     1,000  13.2%   0.5%    6.3%    107,770   0.2%    2.9%
2017-03-30 15:34     1,000  14.2%   0.5%    5.1%    108,455   0.1%    2.9%
2017-03-31 09:55     1,000  16.3%   0.6%    4.0%    109,185   0.2%    2.8%
2017-04-03 09:17     1,000   9.0%   0.1%    1.7%    110,566   0.1%    1.1%
2017-04-04 09:38     1,000   4.4%   0.0%    1.3%    110,536   0.0%    0.6%
2017-04-05 10:18     1,000   2.9%   0.3%    1.0%    110,598   0.0%    0.8%

(Stats for latest 20 builds on all system health config/benchmarks, for non-reference builds.)

I think that, among other things, a lot of that recent drop in failures is due to the fix for  issue 691654 . So, big shout out again to John for that one!

Mostly we're back to manageable levels. I took over the remaining "blocked on" bugs on Android, to work on fixing/re-enabling the remaining disabled stories.

Several of the remaining disabled stories are blocked on  issue 679768  (chrome:tracing fails when the trace is too large).

And the rest of the disabled are a few scattered desktop stories.
Status: Archived (was: Assigned)
This issue was created > 6 months ago. The perf waterfall has changed significantly since then. If this bug is still relevant, please re-open.
For what it's worth, I keep looking at the benchmark status (something similar to #57 above) at least once a week, filing bugs when necessary. But I haven't updated this one in particular, and I agree it's probably fine to close.

Things are (somewhat) more stable now, but more work has yet to be done. Hoping that having this sort of data from flakyness dashboard will also give more visibility to problems when they occur and help us keep the waterfall healthier.

Sign in to add a comment