New issue
Advanced search Search tips
Note: Color blocks (like or ) mean that a user may not be available. Tooltip shows the reason.

Issue 786028 link

Starred by 1 user

Issue metadata

Status: Fixed
Owner:
Closed: Nov 2017
Cc:
Components:
EstimatedDays: ----
NextAction: ----
OS: ----
Pri: 1
Type: Bug

Blocking:
issue 788820



Sign in to add a comment

[Findit] Flake Analyzer - New task requests to swarming can fail silently

Project Member Reported by wylieb@chromium.org, Nov 16 2017

Issue description

There are cases where we get back errors from swarming, but somehow we assume the request succeeded, and continue anyways.

Analysis: 
https://findit-for-me.appspot.com/waterfall/flake?key=ag9zfmZpbmRpdC1mb3ItbWVy6AELEhdNYXN0ZXJGbGFrZUFuYWx5c2lzUm9vdCKxAWNocm9taXVtLm1lbW9yeS9MaW51eCBDaHJvbWl1bSBPUyBBU2FuIExTYW4gVGVzdHMgKDEpLzI0NzUwL2NvbnRlbnRfYnJvd3NlcnRlc3RzL1UybDBaVkJsY2xCeWIyTmxjM05DY205M2MyVnlWR1Z6ZEM1RFlXNWpaV3hYYUdWbGJGTmpjbTlzYkVKMVltSnNhVzVuVDI1WGFHVmxiRlJoY21kbGRFUmxiR1YwYVc5dQwLEhNNYXN0ZXJGbGFrZUFuYWx5c2lzGAEM

Status:
https://findit-for-me.appspot.com/_ah/pipeline/status?root=1c8f93a31e784b90a84307ca5caa87a7&auto=false#pipeline-bc4de46a49c8423d90d83eefa99e894c

LOG:
2017-11-16 06:37:06.388 PST
got response status 200 for url https://chromium-swarm.appspot.com/_ah/api/swarming/v1/task/39dcebf1c312d010/request (/base/data/home/apps/s~findit-for-me/waterfall-backend:12746-fb6e95f.405435860884942934/libs/http/interceptor.py:123)


2017-11-16 06:37:06.410 PST
got exception <class 'google.appengine.api.urlfetch_errors.ConnectionClosedError'>("Connection closed unexpectedly by server at URL: https://chromium-swarm.appspot.com/_ah/api/swarming/v1/tasks/new") for url https://chromium-swarm.appspot.com/_ah/api/swarming/v1/tasks/new (/base/data/home/apps/s~findit-for-me/waterfall-backend:12746-fb6e95f.405435860884942934/libs/http/interceptor.py:130)

2017-11-16 06:37:06.411 PST
Retrying connection to https://chromium-swarm.appspot.com/_ah/api/swarming/v1/tasks/new in 60 seconds (/base/data/home/apps/s~findit-for-me/waterfall-backend:12746-fb6e95f.405435860884942934/waterfall/swarming_util.py:181)

2017-11-16 06:38:06.701 PST
got response status 200 for url https://chromium-swarm.appspot.com/_ah/api/swarming/v1/tasks/new

This pipeline returned a task_id of 'no task - exception'. This whole stack fails silently.
 

Comment 1 by wylieb@chromium.org, Nov 16 2017

Labels: -Pri-3 Pri-1

Comment 2 by st...@chromium.org, Nov 16 2017

Status: Assigned (was: Untriaged)

Comment 3 by wylieb@chromium.org, Nov 21 2017

Summary: [Findit] Flake Analyzer - New task requests to swarming can fail silently (was: [Findit] New task requests to swarming can fail silently)
The root cause of this appears to be that when a build number is determined to be invalid at triggering time (can't be compiled or whatever), then a task id no task is returned. This bubbles up to UpdateFlakeSwarmingTaskDataPointsPipeline.

Comment 5 by wylieb@chromium.org, Nov 27 2017

Blocking: 788820

Comment 6 by wylieb@chromium.org, Nov 29 2017

Status: Fixed (was: Assigned)

Sign in to add a comment