New issue
Advanced search Search tips
Note: Color blocks (like or ) mean that a user may not be available. Tooltip shows the reason.

Issue 670326 link

Starred by 1 user

Issue metadata

Status: Untriaged
Owner: ----
Cc:
Components:
EstimatedDays: ----
NextAction: ----
OS: ----
Pri: 3
Type: Feature

Blocked on:
issue 704839



Sign in to add a comment

chromium-try-flakes: Differentiate between different types of failures with the same step name

Project Member Reported by qyears...@chromium.org, Dec 1 2016

Issue description

As far as I know right now, chromium-try-flakes mainly divides flakes into groups based on the step name that failed.

I was wondering if it would be worth it to allow division based on other things about the failure besides step name, e.g. whether the failure was a timeout or not, or what platform the failure was on.

For example,  bug 637478  is the bug for all flaky failures on try bots for the step name "webkit_tests (with patch)". The discussion in that bug is only limited to timeout failures caused by hanging on Windows, although there are some other flakes on other platforms for other reasons.

chromium-try-flakes page: https://chromium-try-flakes.appspot.com/all_flake_occurrences?key=ahVzfmNocm9taXVtLXRyeS1mbGFrZXNyJAsSBUZsYWtlIhl3ZWJraXRfdGVzdHMgKHdpdGggcGF0Y2gpDA

It might be useful to be able to separate out flakes on different platforms and with different failure modes.
 
Cc: -serg...@chromium.org
Status: Available (was: Unconfirmed)
We also break by test name when it's available (i.e. when test launcher has uploaded a valid test results JSON file to test-results.appspot.com) and file separate bugs for each test. For flakes like that we include a link to the Flakiness Dashboard that breaks flakes by platforms. We do not break by failure type yet, but that should be a separate bug filed against Infra>Flakiness>Dashboard component.

Unfortunately it is true that we do not do as good job for steps. Right we mostly focus on improving the tools detecting where most test flakiness is coming from (Flakiness Surface project), but perhaps we can do better in the future. Thanks for filing this bug and if you are willing to contribute to Flakiness Pipeline, please let me know - I'll be glad to give you some pointers on where to start.
I've learned that some tests, e.g. V8 and GPU tests, are running same tests on different platforms or using different configs and we need additional way of distinguishing these runs from each other. Using step name is usually insufficient since we normalize step name by dropping everything after first space except (with patch) suffix.
Cc: estaab@chromium.org katthomas@chromium.org serg...@chromium.org
 Issue 671874  has been merged into this issue.
I wrote a proposal that will help to address this issue: https://docs.google.com/document/d/1aCf915IjQX4DlOU8kiP51ORFIJGzCaAuacILWU9A1fA (sorry, internal only). I am not planning to work on this immediately since I still focus my work on Flakiness Surface, but I've decided to document my thoughts in this doc before I forget them or if someone else decides to work on this issue.
Blockedon: 704839
Project Member

Comment 6 by sheriffbot@chromium.org, Apr 25 2018

Labels: Hotlist-Recharge-Cold
Status: Untriaged (was: Available)
This issue has been Available for over a year. If it's no longer important or seems unlikely to be fixed, please consider closing it out. If it is important, please re-triage the issue.

Sorry for the inconvenience if the bug really should have been left as Available.

For more details visit https://www.chromium.org/issue-tracking/autotriage - Your friendly Sheriffbot
Cc: -serg...@chromium.org st...@chromium.org

Sign in to add a comment