Because checking for sufficient sample size to declare that two revisions are not different has been delegated to compare_samples in catapult, the refactored bisect is now breaking when re-testing a revision that produces no comparable values. (As it defaults to NEED_MORE_DATA)
E.g.
https://build.chromium.org/p/tryserver.chromium.perf/builders/staging_mac_10_10_perf_bisect/builds/20
Bisect needs to either:
A) Set an upper limit to the number of times a revision is re-tested regardless of the number of observations yielded by the test, or
B) Give up if either sample is empty after a fixed number of tests.
Comment 1 by dtu@chromium.org
, Oct 28 2016