Issue metadata
Sign in to add a comment
|
14.4%-18% regression in media.desktop at 536997:537090 |
||||||||||||||||||||
Issue descriptionSee the link to graphs below.
,
Feb 20 2018
📍 Pinpoint job started. https://pinpoint-dot-chromeperf.appspot.com/job/11e02307840000
,
Feb 20 2018
📍 Found a significant difference after 1 commit. https://pinpoint-dot-chromeperf.appspot.com/job/11e02307840000 Revert "[test_env.py] Warm up vpython virtualenv cache on swarming task shards." by kbr@chromium.org https://chromium.googlesource.com/chromium/src/+/7dde857919af2f59fcab10264d696301a15c64da Understanding performance regressions: http://g.co/ChromePerformanceRegressions
,
Feb 20 2018
If the change to test_env.py really boosted benchmark scores then that's pretty interesting and perhaps concerning because benchmarks should be more isolated from their environment than that.
,
Feb 20 2018
Very interesting, but I think this can be a bit unavoidable because the regressed metric here is power, which is measured for the whole machine by Battor device. Anything that affect the platform will affect the metric.
,
Feb 20 2018
+charliea@ is working on reducing noise.
,
Feb 20 2018
I'm trying to understand what the issue is a little bit better here. I think that I understand what vpython is: it's Python that goes through an additional layer of indirection (like rvm, ruby version manager) that better allows you to control the dependencies (e.g. Python version, Python packages) that your program is pulling in. So the case here seems to be that vpython start up time can vary and, because some of our benchmarks have that vpython start up time in their critical path, this start up time is introducing noise to the tests. Also, it seems that we're able to warm up the vpython cache by just booting up vpython really quickly in our test harness before starting the test. Warming up this cache reduces that start up time variability and therefore reduces noise. So the graphs above clearly show that warming up the vpython cache beforehand adversely impacted power during media.desktop. What instance of vpython is started while we're measuring the average power of media.desktop, though? It seems like Telemetry should be long-started by this point, no?
,
Feb 21 2018
+iannucci who is the vpython expert I suspect that warming up vpython so much earlier -- before the test target even had a chance to run -- eliminated the variability of warming it up when running the test target. I gather that all LUCI bots are going to use vpython by default, so that should further eliminate variability.
,
Feb 21 2018
,
Feb 21 2018
kbr@ yep, I kind of understand how that might change the results of some sort of microbenchmark in which many of the measurements take place 1s after Python startup or something, but it seems *nuts* that it'd change the overall power consumed by a long (10-20s) story by something like 20%. Based on the graphs, it obviously does, but that kind of shatters my belief that we have even close to a grasp on the factors that might affect power noise in our benchmark harness.
,
Feb 21 2018
,
Feb 21 2018
Issue 814204 has been merged into this issue.
,
Feb 21 2018
Issue 814205 has been merged into this issue.
,
Feb 21 2018
FTR: vpython startup time definitely can vary a lot:
* If the virtualenv indicated by the .vpython spec file doesn't exist:
* It fetches the wheels from CIPD (which may be cached)
* It generates a new virtualenv
* It installs the wheels into that new virtualenv
* It launches python in the virtualenv
Warming up the cache beforehand will skip out all the variable bits though, and is the recommended solution. Swarming tasks should also persist these virtualenvs between runs (using a named cache directory), which should also make the warmup phase a no-op most of the time.
,
Feb 22 2018
Sorry, the Pinpoint message is a little misleading here. The vpython CL didn't cause a perf regression -- the "difference" it's talking about is from failing to passing.
,
Feb 22 2018
,
Feb 23 2018
,
Feb 23 2018
Issue 814202 has been merged into this issue.
,
Mar 15 2018
|
|||||||||||||||||||||
►
Sign in to add a comment |
|||||||||||||||||||||
Comment 1 by 42576172...@developer.gserviceaccount.com
, Feb 20 2018