Non-Continuous Perf Gap on Mobile |
||||
Issue descriptionWe're looking into the Speedometer2.0 perf data and noticed that the bots stopped producing results after 2018-07-03 [1]. However there are new bots available which were never announced! Old bots: android-nexus-5 New bots: Android Nexus5 Perf From the user-side of things things change is quite unfortunate and not the only hiccup in the recent month, and thus we have a few questions: - Why is there no announcement email that bots are going to be renamed? This is the second time in a month that we're all of a sudden realizing that our benchmarks don't produce data anymore, only to painfully realize that the bots have been renamed. - Why is there no clear indication on chromeperf marking bots as deprecated? We don't always double check the dates and this easily overlaps with bots actually stopping to work, and we have no clue about this when looking at the performance graphs. Most of us have save dashboards for the benchmarks that are relevant to our day to day work. On these graphs there is absolutely no indication of whether a bot has been deactivated. - Why are bots switched in the middle of a quarter without overlap? We're doing Q per Q performance evaluation, having no overlap renders quite a bit of data useless to us. [1] https://chromeperf.appspot.com/report?sid=d81c6966288faf7afbd63a356cd40b23aea186f9d05dc656fd0570e6af8f1e73
,
Jul 16
,
Jul 16
I think the key problem here is v8 team are using the perf dashboard the way that are not well supported. The key focus of the product has always been around enable teams to get regression alerts. The use cases of watching over the performance graphs and quater-to-quarter evaluation are organic uses, thus not currently well designed yet. I am not sure what to do about this bug. Probably better to bring this overall issue to the v8-benchmarking discussion.
,
Jul 16
Also to clarify the bug, this is only a builder renaming, there is no bots that get switched. So if we do perf dashboard migration from old name to new name (at least for data started since this quarter), there would be no gap
,
Jul 16
I think we need to have a fundamental discussion on how to use the dashboard then. Most of the V8 folks doing performance work have to look at the graphs the old-fashioned way, there is simply no possibility to get positive feedback for sub-1% improvements in most cases. Plus you always have to compare trade-offs, something that's also very hard to get from just looking at the regressions themselves. Indeed, V8 use-cases are probably not well understood here. I don't think any of these issues are new, in fact I remember having similar discussions 2 years ago.
,
Jul 16
Having a fundamental discussion sgtm. For this immediate bug though, I think we can still fix some of the bleeding by doing data migration. Simon: if you think the data migration is the right solution, can you help with triaging?
,
Jul 16
+dtu, +ushesh re: #c0 - Why is there no announcement email that bots are going to be renamed? Sorry about that but unfortunately the name change was only relayed to us after the change went live as well, and I discussed concerns with benchmarking team about the possibility of data loss if we didn't do some sort of migration. - Why is there no clear indication on chromeperf marking bots as deprecated? Believe it takes 2 weeks for the dashboard to consider something deprecated (ie. no data being uploaded anymore). - Why are bots switched in the middle of a quarter without overlap? Emily summarized our discussion in a doc here with some possible approaches to migrating the data: https://docs.google.com/document/d/1NhMuaeYKxpRy56l-2Wvh0KJ0SIJ2FV6sSDK5qv_7TNk/edit It kind of sounds like v8's use case is to look at fixed reports over time periods of a few months? Which, if that's the case, isn't unreasonable or unsupported. Dave and I are discussing adding an OKR around establishing some best-practices for these kind of breaking changes.
,
Jul 20
|
||||
►
Sign in to add a comment |
||||
Comment 1 by hablich@chromium.org
, Jul 16Components: Speed>Dashboard Speed>Benchmarks