New issue
Advanced search Search tips

Issue 744744 link

Starred by 1 user

Issue metadata

Status: WontFix
Owner: ----
Closed: Aug 2017
Cc:
EstimatedDays: ----
NextAction: ----
OS: ----
Pri: 2
Type: Bug-Regression

Blocked on:
issue 664765



Sign in to add a comment

62.7% regression in system_health.common_desktop at 486846:486989

Project Member Reported by nzolghadr@chromium.org, Jul 17 2017

Issue description

See the link to graphs below.
 
Project Member

Comment 1 by 42576172...@developer.gserviceaccount.com, Jul 17 2017

All graphs for this bug:
  https://chromeperf.appspot.com/group_report?bug_id=744744

(For debugging:) Original alerts at time of bug-filing:
  https://chromeperf.appspot.com/group_report?sid=f7fa98856661eb9801f283b36626ec8391c0a360ac5ecaaea43ca5ebb6873dd4


Bot(s) for this bug's original alert(s):

chromium-rel-mac-retina
Project Member

Comment 3 by 42576172...@developer.gserviceaccount.com, Jul 17 2017


=== BISECT JOB RESULTS ===
Bisect failed for unknown reasons

Please contact the team (see below) and report the error.


Bisect Details
  Configuration: mac_retina_perf_bisect
  Benchmark    : system_health.common_desktop
  Metric       : after_load:power_avg/multitab_misc/multitab_misc_typical24


To Run This Test
  src/tools/perf/run_benchmark -v --browser=release --output-format=chartjson --upload-results --pageset-repeat=1 --also-run-disabled-tests --story-filter=multitab.misc.typical24 system_health.common_desktop

More information on addressing performance regressions:
  http://g.co/ChromePerformanceRegressions

Debug information about this bisect:
  https://chromeperf.appspot.com/buildbucket_job_status/8973798200156248080


For feedback, file a bug with component Speed>Bisection
Cc: charliea@chromium.org
Status: Available (was: Untriaged)
The ref build changed a little bit, but then the signal starts becoming more noisy. Adding Charlie to see if this is something we can look into.
Project Member

Comment 6 by 42576172...@developer.gserviceaccount.com, Aug 11 2017


=== BISECT JOB RESULTS ===
Bisect was unable to run to completion

Error: INFRA_FAILURE

The bisect was able to narrow the range, you can try running with:
  good_revision: 8c1ea7c7b7fc0559e586b8e8744a18855758c3a1
  bad_revision : 2ca6ae0f8886b307b24f8ff5a269a5921d0c093f

If failures persist contact the team (see below) and report the error.


Bisect Details
  Configuration: mac_retina_perf_bisect
  Benchmark    : system_health.common_desktop
  Metric       : after_load:power_avg/multitab_misc/multitab_misc_typical24

Revision             Result                   N
chromium@486750      6.08068 +- 0.233281      5      good
chromium@486870      6.06634 +- 0.260769      6      good
chromium@486930      6.39238 +- 0.337659      6      good
chromium@486989      10.0179 +- 0.249568      6      bad

To Run This Test
  src/tools/perf/run_benchmark -v --browser=release --output-format=chartjson --upload-results --pageset-repeat=1 --also-run-disabled-tests --story-filter=multitab.misc.typical24 system_health.common_desktop

More information on addressing performance regressions:
  http://g.co/ChromePerformanceRegressions

Debug information about this bisect:
  https://chromeperf.appspot.com/buildbucket_job_status/8971623694851923952


For feedback, file a bug with component Speed>Bisection
Cc: simonhatch@chromium.org
I'm not sure why that last bisect bisect pooped out right in the middle. Going to try to relaunch it with the narrowed range.

simonhatch@, do you have any idea what happened here?
Blockedon: 664765
I believe there's a 1 hour step limit, and for whatever reason one of the steps took longer than an hour (normal perf test runs took about 15 mins prior to that). Not clear what causes it to freeze like this, will block on crbug.com/664765
Project Member

Comment 10 by 42576172...@developer.gserviceaccount.com, Aug 15 2017


=== BISECT JOB RESULTS ===
Bisect was unable to run to completion

Error: INFRA_FAILURE

The bisect was able to narrow the range, you can try running with:
  good_revision: 4e145f3c41d31bd16f10b5a2bd2dfc0c38a3d87c
  bad_revision : 1b46145c5ed1e7266c1beaa3338c19ff547933be

If failures persist contact the team (see below) and report the error.


Bisect Details
  Configuration: mac_retina_perf_bisect
  Benchmark    : system_health.common_desktop
  Metric       : after_load:power_avg/multitab_misc/multitab_misc_typical24

Revision             Result                   N
chromium@486930      6.56914 +- 1.14463       6      good
chromium@486945      6.33133 +- 0.246864      6      good
chromium@486953      6.46046 +- 0.446997      5      good
chromium@486960      10.1174 +- 0.566069      6      bad
chromium@486989      10.0315 +- 0.379159      5      bad

To Run This Test
  src/tools/perf/run_benchmark -v --browser=release --output-format=chartjson --upload-results --pageset-repeat=1 --also-run-disabled-tests --story-filter=multitab.misc.typical24 system_health.common_desktop

More information on addressing performance regressions:
  http://g.co/ChromePerformanceRegressions

Debug information about this bisect:
  https://chromeperf.appspot.com/buildbucket_job_status/8971280235385680448


For feedback, file a bug with component Speed>Bisection
Third time's the charm...
One problem seems to be that with  bug 754746 , where the BattOr agent is crashing. I'm not sure why it's taking so long to do so, though.
Project Member

Comment 14 by 42576172...@developer.gserviceaccount.com, Aug 15 2017


=== BISECT JOB RESULTS ===
Bisect failed for unknown reasons

Please contact the team (see below) and report the error.


Bisect Details
  Configuration: mac_retina_perf_bisect
  Benchmark    : system_health.common_desktop
  Metric       : after_load:power_avg/multitab_misc/multitab_misc_typical24


To Run This Test
  src/tools/perf/run_benchmark -v --browser=release --output-format=chartjson --upload-results --pageset-repeat=1 --also-run-disabled-tests --story-filter=multitab.misc.typical24 system_health.common_desktop

More information on addressing performance regressions:
  http://g.co/ChromePerformanceRegressions

Debug information about this bisect:
  https://chromeperf.appspot.com/buildbucket_job_status/8971170182904069744


For feedback, file a bug with component Speed>Bisection
(╯°□°)╯︵ ┻━┻

This is actually failing due to a bug that I already fixed (after the last failed bisect), so _hopefully_ if I kick off the bisect again, it'll pick up the new version of Telemetry and I won't have to see this frickin' failure again.
Cc: dtu@chromium.org
Bisects run with the version of telemetry pinned to the revision they're running at. So I think the bisect will keep failing.

+dtu is it easier to do better with pinpoint?
Ah, shucks. Never mind then :-( It's likely this is just going to keep failing, then. For some reason, I thought the Catapult and benchmark isolate was pinned to HEAD but the chrome version varied. Looks like I was wrong.
Project Member

Comment 19 by 42576172...@developer.gserviceaccount.com, Aug 25 2017


=== BISECT JOB RESULTS ===
Bisect failed for unknown reasons

Please contact the team (see below) and report the error.


Bisect Details
  Configuration: mac_retina_perf_bisect
  Benchmark    : system_health.common_desktop
  Metric       : after_load:power_avg/multitab_misc/multitab_misc_typical24


To Run This Test
  src/tools/perf/run_benchmark -v --browser=release --output-format=chartjson --upload-results --pageset-repeat=1 --also-run-disabled-tests --story-filter=multitab.misc.typical24 system_health.common_desktop

More information on addressing performance regressions:
  http://g.co/ChromePerformanceRegressions

Debug information about this bisect:
  https://chromeperf.appspot.com/buildbucket_job_status/8970280912470798064


For feedback, file a bug with component Speed>Bisection
Status: WontFix (was: Available)
Looks like we won't be able to get to the bottom of this, then. Charlie, as benchmark owner, please reopen if you think it's worth investigating manually.

Comment 21 by dtu@chromium.org, Sep 1 2017

Yes! By default Pinpoint will use the catapult specified by chromium/src DEPS, and you can override it to pin it to a particular catapult revision.

We've also thought about things like running the job with a particular patch or commit cherry-picked over the failing range, but that's still in the future.

Sign in to add a comment