New issue
Advanced search Search tips
Note: Color blocks (like or ) mean that a user may not be available. Tooltip shows the reason.

Issue 671592 link

Starred by 1 user

Issue metadata

Status: Fixed
Owner:
Last visit > 30 days ago
Closed: May 2017
Cc:
Components:
EstimatedDays: ----
NextAction: ----
OS: ----
Pri: 1
Type: Bug



Sign in to add a comment

"isolate" is flaky

Project Member Reported by chromium...@appspot.gserviceaccount.com, Dec 6 2016

Issue description

"isolate" is flaky.

This issue was created automatically by the chromium-try-flakes app. Please find the right owner to fix the respective test/step and assign this issue to them. If the step/test is infrastructure-related, please add Infra-Troopers label and change issue status to Untriaged. When done, please remove the issue from Sheriff Bug Queue by removing the Sheriff-Chromium label.

We have detected 3 recent flakes. List of all flakes can be found at https://chromium-try-flakes.appspot.com/all_flake_occurrences?key=ahVzfmNocm9taXVtLXRyeS1mbGFrZXNyEgsSBUZsYWtlIgdpc29sYXRlDA.



This flaky test/step was previously tracked in  issue 669391 .
 
Labels: -Sheriff-Chromium Infra-Troopers
There have been three errors.  All seem to be trying to push different files to isolate server and failing with 500 error from isolate server.  I don't believe this is the same issue as 669391.

[81116] [23346/6.77Gib/23346] [23346/23346] [85/5.10Gib/86/5.31Gib] 4m12s
18:29:52.584374 doPush(64737e0f83daa2c849c18bb5453ae52e4fb81f28) failed: http request failed: Internal Server Error (HTTP 500)
&isolatedclient.PushState{status:isolateservice.HandlersEndpointsV1PreuploadStatus{GsUploadUrl:"https://isolateserver.storage.googleapis.com/default-gzip/64737e0f83daa2c849c18bb5453ae52e4fb81f28?GoogleAccessId=553957813421%40developer.gserviceaccount.com&Expires=1481005155&Signature=JwDxhC3WS4aO40JMX6hSFgPINlnHdH1wlZ%2BLft4xnglOpgjlkMxoeAcGVtlv6YpHyV54haJHeql2Fghe040CjIr58cgELUekLCqCr2BM%2BwY1smlQRh3SmtJURQOa4F3SA6uzE8SzlvbY%2F6BUzABzG736wr5IX6cbhEZKT0POob0%3D", Index:0, UploadTicket:"AXsiX2kiOiIxNDgwOTkwNzU1MDkxIiwiZCI6IjY0NzM3ZTBmODNkYWEyYzg0OWMxOGJiNTQ1M2FlNTJlNGZiODFmMjgiLCJpIjoiMSIsIm4iOiJkZWZhdWx0LWd6aXAiLCJzIjoiMjI3MTM1NDg4In1ccYjLjd5XMkGpIx77K-UZc7DOpLG_sLc48dyv5PqEfg", ForceSendFields:[]string(nil), NullFields:[]string(nil)}, digest:"64737e0f83daa2c849c18bb5453ae52e4fb81f28", size:227135488, uploaded:false, finalized:false}
18:29:52.584374 Uploaded 216.6Mib: out\Release_x64\app_shell_unittests.exe.pdb
Hits    : 23260 (1.46Gib)
Misses  :    86 (5.31Gib)
Duration: 10m39.438s
isolate: push(E:\b\c\b\win\src\out\Release_x64\app_shell_unittests.exe.pdb) failed: http request failed: Internal Server Error (HTTP 500)


[110160] [29646/9.07Gib/29646] [29646/29646] [214/7.13Gib/215/7.13Gib] 2m44s
18:33:59.126177 doPush(e748cbd1f4307da0d2e5a38f3542bc15e3707ecc) failed: http request failed: Internal Server Error (HTTP 500)
&isolatedclient.PushState{status:isolateservice.HandlersEndpointsV1PreuploadStatus{GsUploadUrl:"https://isolateserver.storage.googleapis.com/default-gzip/e748cbd1f4307da0d2e5a38f3542bc15e3707ecc?GoogleAccessId=553957813421%40developer.gserviceaccount.com&Expires=1481005581&Signature=h%2BnUzNF5BzQjGMEeL0WNUpFI%2B%2F3QIbL2k%2FKsQq1eK165hWJPuyHRqzFmXNX0AA1Fpi4XcT4AZkYzDevbDBkFMFjm0YEU9XcWaZNJ99j6K%2B4VR3CVfRpJ99Bu7e4EraQ2v3LfJj%2Ftf53AEXK%2FzTMtHZ5K8zBDS3dUsduq4CH0R%2BE%3D", Index:2, UploadTicket:"AXsiX2kiOiIxNDgwOTkxMTgxMjMxIiwiZCI6ImU3NDhjYmQxZjQzMDdkYTBkMmU1YTM4ZjM1NDJiYzE1ZTM3MDdlY2MiLCJpIjoiMSIsIm4iOiJkZWZhdWx0LWd6aXAiLCJzIjoiMTk0NTYwMCJ9hbKRLGCdtoUp_aamBBA-vQUNpn__NdLAOBBn_sQ0Vh4", ForceSendFields:[]string(nil), NullFields:[]string(nil)}, digest:"e748cbd1f4307da0d2e5a38f3542bc15e3707ecc", size:1945600, uploaded:false, finalized:false}
18:33:59.126177 Uploaded 1.86Mib: out\Release_x64\battor_agent_unittests.exe
Hits    : 29431 (1.93Gib)
Misses  :   215 (7.13Gib)
Duration: 7m56.097s
isolate: push(E:\b\c\b\win\src\out\Release_x64\battor_agent_unittests.exe) failed: http request failed: Internal Server Error (HTTP 500)


[110174] [29654/9.09Gib/29654] [29654/29654] [318/7.69Gib/319/7.69Gib] 7m46s
20:53:36.136712 doPush(4800bcadb3376668c73145ff2f8dbd362adca070) failed: http request failed: Service Unavailable (HTTP 503)
&isolatedclient.PushState{status:isolateservice.HandlersEndpointsV1PreuploadStatus{GsUploadUrl:"https://isolateserver.storage.googleapis.com/default-gzip/4800bcadb3376668c73145ff2f8dbd362adca070?GoogleAccessId=553957813421%40developer.gserviceaccount.com&Expires=1481013643&Signature=zlsFBRendxQf7ReMcXostZeezWJX1VA4ZFlHolCn0b2SARd0bDSlzb6MR1PulvFuGfB6QH1KRNGQ4bIvsvXXpKCnSJdSXlLBfW7pDnkbWEMt9Km9qWEF3oli%2BJcObcI2mSTvNNfzFZByoWgUyR4qxd6mfkJJ%2FCAbpcfP%2BNqr2bg%3D", Index:1, UploadTicket:"AXsiX2kiOiIxNDgwOTk5MjQzNTQzIiwiZCI6IjQ4MDBiY2FkYjMzNzY2NjhjNzMxNDVmZjJmOGRiZDM2MmFkY2EwNzAiLCJpIjoiMSIsIm4iOiJkZWZhdWx0LWd6aXAiLCJzIjoiNjI1NDU5MiJ9TMx9uSwpQT7piFJxZUewfNLAq7X3OjvphA6ddKU_u_s", ForceSendFields:[]string(nil), NullFields:[]string(nil)}, digest:"4800bcadb3376668c73145ff2f8dbd362adca070", size:6254592, uploaded:false, finalized:false}
20:53:36.136712 Uploaded 5.96Mib: out\Release_x64\scoped_handle_test_dll.dll.pdb
Hits    : 29335 (1.39Gib)
Misses  :   319 (7.69Gib)
Duration: 13m3.604s
isolate: push(E:\b\c\b\win\src\out\Release_x64\scoped_handle_test_dll.dll.pdb) failed: http request failed: Service Unavailable (HTTP 503)

Cc: vadimsh@chromium.org
Owner: mar...@chromium.org
Status: Assigned (was: Untriaged)
Cc: mcgreevy@chromium.org mar...@chromium.org tansell@chromium.org
Owner: djd@chromium.org
Dave/Michael, want to dig deeper? That's related to the isolate go client.

Comment 4 by djd@chromium.org, Dec 6 2016

That looks like the GCS service returning the 500s. That's definitely not surprising (it's known to be flaky on uploads), but my reading of the isolate cmd is that it will retry on 500 errors. I'll have to look deeper.
My only concern is that I thought the automatic retry should have kicked in. :/

Comment 6 by djd@chromium.org, Dec 7 2016

Status: Started (was: Assigned)
A cursory look at this is that (at least from archive) we're passing in a nil retry.Factory [1] which prevents retries [2]

[1] https://github.com/luci/luci-go/blob/master/client/cmd/isolate/archive.go#L68
[2] https://godoc.org/github.com/luci/luci-go/common/retry#Factory
</shamecube>
Project Member

Comment 8 by bugdroid1@chromium.org, Dec 13 2016

The following revision refers to this bug:
  https://chromium.googlesource.com/external/github.com/luci/luci-go.git/+/64707122e72f7405ce9b04b38323cf8bbf7cbf2d

commit 64707122e72f7405ce9b04b38323cf8bbf7cbf2d
Author: djd <djd@chromium.org>
Date: Tue Dec 13 02:22:05 2016

common/lhttp: close the Response.Body on non-200s

Refactor lhttp NewRequest to make sure that the response's body is
always drained/closed in the case that the handler is not invoked (for
5xx responses, etc.).

Add text to the returned error from NewRequest to make it how many
attempts were attempted before failing.

BUG= 671592 

Review-Url: https://codereview.chromium.org/2562293002

[modify] https://crrev.com/64707122e72f7405ce9b04b38323cf8bbf7cbf2d/common/lhttp/client.go
[modify] https://crrev.com/64707122e72f7405ce9b04b38323cf8bbf7cbf2d/common/lhttp/client_test.go

Project Member

Comment 9 by bugdroid1@chromium.org, Dec 13 2016

The following revision refers to this bug:
  https://chromium.googlesource.com/infra/infra.git/+/f84d8594c980fc2ed8a7cedf87bba9344f570afe

commit f84d8594c980fc2ed8a7cedf87bba9344f570afe
Author: Dave Day <djd@chromium.org>
Date: Tue Dec 13 03:41:00 2016

Roll luci-go

Rolling forward to @0451078 to pick up latest exparchive changes and fixes to
lhttp retry logic/logging.

BUG= 598990 , 671592 

Change-Id: I9540c694bb69994a58e8836d72d425c61a919b44
Reviewed-on: https://chromium-review.googlesource.com/418103
Reviewed-by: Dave Sansome <dsansome@chromium.org>
Commit-Queue: Dave D <djd@chromium.org>

[modify] https://crrev.com/f84d8594c980fc2ed8a7cedf87bba9344f570afe/DEPS

Project Member

Comment 10 by chromium...@appspot.gserviceaccount.com, Dec 14 2016

Detected 18 new flakes for test/step "isolate". To see the actual flakes, please visit https://chromium-try-flakes.appspot.com/all_flake_occurrences?key=ahVzfmNocm9taXVtLXRyeS1mbGFrZXNyEgsSBUZsYWtlIgdpc29sYXRlDA. This message was posted automatically by the chromium-try-flakes app.

Comment 12 by djd@chromium.org, Dec 19 2016

The latest version of isolate has additional logging to try to work out why these uploads are failing.

Comment 13 by djd@chromium.org, Dec 28 2016

Owner: mcgreevy@chromium.org
Project Member

Comment 14 by chromium...@appspot.gserviceaccount.com, Mar 1 2017

Detected 3 new flakes for test/step "isolate". To see the actual flakes, please visit https://chromium-try-flakes.appspot.com/all_flake_occurrences?key=ahVzfmNocm9taXVtLXRyeS1mbGFrZXNyEgsSBUZsYWtlIgdpc29sYXRlDA. This message was posted automatically by the chromium-try-flakes app.
Ignore flakes today as it is due to issue 697275.
Project Member

Comment 16 by chromium...@appspot.gserviceaccount.com, Mar 24 2017

Detected 7 new flakes for test/step "isolate". To see the actual flakes, please visit https://chromium-try-flakes.appspot.com/all_flake_occurrences?key=ahVzfmNocm9taXVtLXRyeS1mbGFrZXNyEgsSBUZsYWtlIgdpc29sYXRlDA. This message was posted automatically by the chromium-try-flakes app.
Status: Fixed (was: Started)
We haven't see any new flakes since Mar 24th. I'm going to close this bug for now.

Sign in to add a comment