New issue
Advanced search Search tips
Note: Color blocks (like or ) mean that a user may not be available. Tooltip shows the reason.

Issue 593055 link

Starred by 1 user

Issue metadata

Status: Fixed
Owner:
Last visit > 30 days ago
Closed: Apr 2016
Cc:
Components:
EstimatedDays: ----
NextAction: ----
OS: All
Pri: 2
Type: Bug



Sign in to add a comment

Bisect should respect repeat count.

Project Member Reported by simonhatch@chromium.org, Mar 8 2016

Issue description

https://build.chromium.org/p/tryserver.chromium.perf/builders/android_nexus5_perf_bisect/builds/3473

Specified repeat count of 1, running test 5 times. It's a consistent failure, so no need to run so often.
 
Cc: robert...@chromium.org dtu@chromium.org
In recipe_modules/auto_bisect/revision_state.py there is now a "minimum repeat count", which has been there since https://codereview.chromium.org/1610203003 ("Iteratively increase sample size for good/bad classification"), which was meant to make testing repeat until there's "enough evidence" to classify a revision as good or bad.

But return code bisects are a special and interesting case, because if the test doesn't fail flakily, then "enough evidence" is on only one run. Maybe sample size should *not* be iteratively increased in the case of return-code bisects?

Roberto, Dave, WDYT?
Labels: triaged
Status: Available (was: Untriaged)
Owner: robert...@chromium.org
Status: Assigned (was: Available)

Comment 4 by dtu@chromium.org, Mar 17 2016

Cc: pras...@chromium.org
 Issue 592691  has been merged into this issue.

Comment 5 by dtu@chromium.org, Mar 17 2016

There's a question of whether or not the increased code complexity of a special case is worth it. The benefit would be reduced latency/runtime, but we've had problems in the past with the differences between the code paths of return code and perf bisect.

I'd also like bisect to work with flaky test failures, in which case you'd still need the statistical classification.

Comment 6 by dtu@chromium.org, Mar 17 2016

Hm, I could see us wanting it for generalized bisect, so maybe it is worth the complexity of a special case. But maybe not at this point in time?

By "generalized bisect," I mean locally bisecting on arbitrary functional regressions, like  issue 592189  or  issue 593614 .
Status: Started (was: Assigned)
I am going to make a patch to do the exact repeat count for 'return_code' bisects.
Status: Fixed (was: Started)
Components: Speed>Bisection

Sign in to add a comment