Inconsistent data between findit and flakiness dashboard |
|
Issue descriptionIt looks like these two tools try to solve the same problem but one or both have incomplete data (see discussion in https://chromium-review.googlesource.com/c/chromium/src/+/1329161). It would be nice to have a single, unified tool to check for flakes or at least have both tools agree. In the case here, findit reports tests as very flaky and the flake dash reports that it rarely flakes. Flake dash: https://test-results.appspot.com/dashboards/flakiness_dashboard.html#testType=chrome_public_test_apk&tests=testModalNativeAppBannerCanBeTriggeredMultipleTimesCustomTab Findit: https://findit-for-me.appspot.com/flake/occurrences?key=ag9zfmZpbmRpdC1mb3ItbWVykgELEgVGbGFrZSKGAWNocm9taXVtQGNocm9tZV9wdWJsaWNfdGVzdF9hcGtAb3JnLmNocm9taXVtLmNocm9tZS5icm93c2VyLmJhbm5lcnMuQXBwQmFubmVyTWFuYWdlclRlc3QjdGVzdEFwcEluc3RhbGxlZE1vZGFsTmF0aXZlQXBwQmFubmVyQ3VzdG9tVGFiDA
,
Nov 14
>> I'm not familiar with flakiness dashboard. I'd love to learn how does it to determine flakes. I don't think flakiness dashboard 'determines' flakes, it simply presents the test results, though I'm not sure if flakiness dashboard knows how to understand the test results of retry with patch steps.
,
Nov 14
re #2: Correct. "Flakiness Dashboard" is a really misleading name that everyone uses for the tests-results.appspot.com UI. It doesn't know anything about flakes and doesn't try to do any analysis of results over time. So "Flakiness" is only in the name. It is not very smart about retries and patch steps. |
|
►
Sign in to add a comment |
|
Comment 1 by chanli@chromium.org
, Nov 14