New issue
Advanced search Search tips
Note: Color blocks (like or ) mean that a user may not be available. Tooltip shows the reason.

Issue 755359 link

Starred by 1 user

Issue metadata

Status: Verified
Owner:
Closed: Aug 2017
Cc:
Components:
EstimatedDays: ----
NextAction: ----
OS: ----
Pri: 1
Type: Bug

Blocking:
issue 756173



Sign in to add a comment

[Findit] Flake Analyzer - NextBuildNumberPipeline producing the same value over and over again.

Project Member Reported by wylieb@chromium.org, Aug 14 2017

Issue description

Comment 1 by wylieb@chromium.org, Aug 14 2017

Labels: -Pri-2 Pri-1
A good fix would be to check if the build number that's returned by lookback is the same build number that NextBuildNumberPipeline recieved as an argument. If it is, then abort the whole pipeline.

Comment 3 by st...@chromium.org, Aug 16 2017

Cc: lijeffrey@chromium.org
Owner: wylieb@chromium.org
Status: Assigned (was: Untriaged)
I checked a lot of recent analyses of flaky tests, and found below:

1. A lot of analyses were still in RUNNING status even they were a few days ago. However, the pipeline indicates that they are completed.
   Among these 100 analyses, 65 are still in running status, while 17 ran into Errors.
   https://findit-for-me.appspot.com/waterfall/list-flakes?cursor=CuIBChkKDHJlcXVlc3RfdGltZRIJCNyYpqzVrNUCEsABag9zfmZpbmRpdC1mb3ItbWVyrAELEhdNYXN0ZXJGbGFrZUFuYWx5c2lzUm9vdCJ2Y2hyb21pdW0ubGludXgvTGludXggVGVzdHMvNTk2MjAvY29tcG9uZW50c191bml0dGVzdHMvUm1sbGJHUlVjbWxoYkhOUWNtOTJhV1JsY2xSbGMzUXVVSEp2ZG1sa1pWTjViblJvWlhScFkxUnlhV0ZzY3c9PQwLEhNNYXN0ZXJGbGFrZUFuYWx5c2lzGAUMGAAgAQ==&direction=next

2. The "infinite loop" itself stops after ~160 instances of the RecursiveFlakePipeline. The analysis Brandon gave above is a good example. A more recent one is https://findit-for-me.appspot.com/waterfall/flake?key=ag9zfmZpbmRpdC1mb3ItbWVynwELEhdNYXN0ZXJGbGFrZUFuYWx5c2lzUm9vdCJpY2hyb21pdW0ud2luL1dpbjcgVGVzdHMgKGRiZykoMSkvNjIzMzIvYnJvd3Nlcl90ZXN0cy9VMlZ6YzJsdmJsSmxjM1J2Y21WVVpYTjBMbEpsYzNSdmNtVlhaV0pWU1ZObGRIUnBibWR6DAsSE01hc3RlckZsYWtlQW5hbHlzaXMYAQw
   When it just started, it is https://screenshot.googleplex.com/VNUhV7vzemT.png
   But now it is https://screenshot.googleplex.com/h6vB4ZTbNT5.png
   https://findit-for-me.appspot.com/_ah/pipeline/status?root=0d9242e38eb5453087c8cdf321ddff0b&auto=false#pipeline-9ec735c4d228482ca845713073347e06

3. The "infinite loop" happens when Flake Analyzer tries to rerun at the same build point at which the test is stable instead of flaky.


Besides this specific bug, I'm surprised that our analysis pipeline is so unreliable.
It seems better to fix this bigger issue before adding more features to Flake Analyzer.

Brandon, as you have looked into this bug, I'd assign it to you to follow up, and please file a meta bug to stabilize Flake Analyzer with sub-bugs for breakdown tasks that you could identify.

Comment 4 by st...@chromium.org, Aug 16 2017

Update to point 2 above in comment #3: 160 seems low and wrong for the sample I gave in comment #3. It is still running and there have been 200+ instances of RecursiveFlakePipeline already (Children: 985 / 1182 done)
https://findit-for-me.appspot.com/_ah/pipeline/status?root=0d9242e38eb5453087c8cdf321ddff0b&auto=false#pipeline-63586d7e541f4222993b3edd522dd7e6



Comment 5 by wylieb@chromium.org, Aug 16 2017

Blocking: 756173

Comment 6 by wylieb@chromium.org, Aug 16 2017

Looks like the root cause of this is: https://bugs.chromium.org/p/chromium/issues/detail?id=756214

Comment 7 by wylieb@chromium.org, Aug 18 2017

Status: Verified (was: Assigned)

Sign in to add a comment