New issue
Advanced search Search tips
Note: Color blocks (like or ) mean that a user may not be available. Tooltip shows the reason.

Issue 863813 link

Starred by 3 users

Issue metadata

Status: Untriaged
Owner: ----
Cc:
Components:
EstimatedDays: ----
NextAction: ----
OS: ----
Pri: 3
Type: Bug

Blocked on:
issue 865998



Sign in to add a comment

Non-Continuous Perf Gap on Mobile

Project Member Reported by cbruni@chromium.org, Jul 16

Issue description

We're looking into the Speedometer2.0 perf data and noticed that the bots stopped producing results after 2018-07-03 [1].
However there are new bots available which were never announced!

Old bots: android-nexus-5
New bots: Android Nexus5 Perf


From the user-side of things things change is quite unfortunate and not the only hiccup in the recent month, and thus we have a few questions:

- Why is there no announcement email that bots are going to be renamed?
  This is the second time in a month that we're all of a sudden realizing that our 
  benchmarks don't produce data anymore, only to painfully realize that the bots
  have been renamed.

- Why is there no clear indication on chromeperf marking bots as deprecated?
  We don't always double check the dates and this easily overlaps with bots
  actually stopping to work, and we have no clue about this when looking at the
  performance graphs.
  Most of us have save dashboards for the benchmarks that are relevant to our
  day to day work. On these graphs there is absolutely no indication of whether a
  bot has been deactivated.

- Why are bots switched in the middle of a quarter without overlap?
  We're doing Q per Q performance evaluation, having no overlap renders quite a bit of data useless
  to us.


[1] https://chromeperf.appspot.com/report?sid=d81c6966288faf7afbd63a356cd40b23aea186f9d05dc656fd0570e6af8f1e73
 
Cc: simonhatch@chromium.org nednguyen@chromium.org
Components: Speed>Dashboard Speed>Benchmarks
Summary: Non-Continuous Perf Gap on Mobile (was: Non-Continues Perf Gap on Mobile)
I think the key problem here is v8 team are using the perf dashboard the way that are not well supported. 

The key focus of the product has always been around enable teams to get regression alerts. The use cases of watching over the performance graphs and quater-to-quarter evaluation are organic uses, thus not currently well designed yet.

I am not sure what to do about this bug. Probably better to bring this overall issue to the v8-benchmarking discussion.
Also to clarify the bug, this is only a builder renaming, there is no bots that get switched. So if we do perf dashboard migration from old name to new name (at least for data started since this quarter), there would be no gap
I think we need to have a fundamental discussion on how to use the dashboard then.


Most of the V8 folks doing performance work have to look at the graphs the old-fashioned way, there is simply no possibility to get positive feedback for sub-1% improvements in most cases.
Plus you always have to compare trade-offs, something that's also very hard to get from just looking at the regressions themselves.


Indeed, V8 use-cases are probably not well understood here.
I don't think any of these issues are new, in fact I remember having similar discussions 2 years ago.
Having a fundamental discussion sgtm. For this immediate bug though, I think we can still fix some of the bleeding by doing data migration.

Simon: if you think the data migration is the right solution, can you help with triaging?
Cc: dtu@chromium.org ushesh@chromium.org
+dtu, +ushesh

re: #c0

- Why is there no announcement email that bots are going to be renamed?

Sorry about that but unfortunately the name change was only relayed to us after the change went live as well, and I discussed concerns with benchmarking team about the possibility of data loss if we didn't do some sort of migration.


- Why is there no clear indication on chromeperf marking bots as deprecated?

Believe it takes 2 weeks for the dashboard to consider something deprecated (ie. no data being uploaded anymore).


- Why are bots switched in the middle of a quarter without overlap?

Emily summarized our discussion in a doc here with some possible approaches to migrating the data: https://docs.google.com/document/d/1NhMuaeYKxpRy56l-2Wvh0KJ0SIJ2FV6sSDK5qv_7TNk/edit



It kind of sounds like v8's use case is to look at fixed reports over time periods of a few months? Which, if that's the case, isn't unreasonable or unsupported.

Dave and I are discussing adding an OKR around establishing some best-practices for these kind of breaking changes.
Blockedon: 865998

Sign in to add a comment