browser_tests flaky times out on Mac10.12 Tests |
||||||||||
Issue descriptionhttps://luci-milo.appspot.com/buildbot/chromium.mac/Mac10.12%20Tests/3095 https://luci-milo.appspot.com/buildbot/chromium.mac/Mac10.12%20Tests/3079 Not sure whose job this is, but it's purple on the bots, which I think means it goes to the trooper.
,
Jul 28 2017
I had a look over recent purple Mac10.12 runs, and the failures seemed to be different.. e.g. this one had several BOT_DIED and one regular failure in browser_side_navigation_browser_tests https://luci-milo.appspot.com/buildbot/chromium.mac/Mac10.12%20Tests/3218 and this build also had a bunch of BOT_DIED failure: https://luci-milo.appspot.com/buildbot/chromium.mac/Mac10.12%20Tests/3220 whereas this one looks like the test segfaulted. Perhaps that shouldn't cause a purple bot failure, but rather a red one? https://luci-milo.appspot.com/buildbot/chromium.mac/Mac10.12%20Tests/3215
,
Aug 8 2017
It does look like the test suite just gave up and decided to time out. It's purple but probably shouldn't be purple. The test suite should be doing it's own timeout before swarming kills it. Assigning over to Test, whom I assume maintain the test runner?
,
Aug 8 2017
This looks frequently happening (almost once per day or more), so let me bump this up to P1. I looked the following failures to investigate why the test gets stuck so often, but I'm still not sure. The first tests on which browser_tests started to get stuck are different as mentioned at c#2. - https://chromium-swarm.appspot.com/task?id=37caba4fc918da10&refresh=10&show_raw=1 EncryptedMediaSupportedTypesExternalClearKeyTest.InvalidKeySystems - https://chromium-swarm.appspot.com/task?id=37d50fa464b6d610&refresh=10&show_raw=1 MSE_ExternalClearKey/EncryptedMediaTest.Playback_VideoOnly_MP4_VP9/0 - https://chromium-swarm.appspot.com/task?id=37d617b49a5a7010&refresh=10&show_raw=1 MediaEngagementAutoplayBrowserTest.DoNotBypassAutoplayFrameLowEngagement - https://chromium-swarm.appspot.com/task?id=37d886a04bd92610&refresh=10&show_raw=1 IncognitoProfileMainNetworkContext/NetworkContextConfigurationBrowserTest.Cache/0 - https://chromium-swarm.appspot.com/task?id=378f0da96891a110&refresh=10&show_raw=1 NoStatePrefetchBrowserTest.PrerenderSafeBrowsingTopLevel - https://chromium-swarm.appspot.com/task?id=3784bc106334fa10&refresh=10&show_raw=1 MediaStreamPermissionTest.TestDenyingUserMedia
,
Aug 8 2017
Latest one was SubresourceFilterWebSocketBrowserTest.DoNotBlockWebSocketNoActivatedFrame/0 @ https://luci-milo.appspot.com/buildbot/chromium.mac/Mac10.12%20Tests/3630 The one thing that seems common with all of these is ui_test_utils::NavigateToURLWithDispositionBlockUntilNavigationsComplete() which performs a nested RunLoop blocking until the navigation completes [1]. So navigation response isn't arriving. Hard to tell what the underlying issue is. Maybe a full dump would help? +jam@: PlzNavigate? [1] Or one of them is in content::TestURLLoaderClient::RunUntilResponseReceived() but same thing -- blocked on navigation response.
,
Aug 8 2017
This is happening for both plznavigate and non plznavigate (i.e. last 2 links in comment 4).
,
Aug 9 2017
This error is reminiscent of the behavior when the window server crashes, although the symptoms seem fairly different. https://bugs.chromium.org/p/chromium/issues/detail?id=653353 https://bugs.chromium.org/p/chromium/issues/detail?id=515627 Crashing in 3 ??? 0x00007fff5a0bded8 0x0 + 140734704115416 4 CoreFoundation 0x00007fffb5a63e84 __CFRunLoopServiceMachPort + 212 Seems fairly worrying. I wonder if this is because many of the bots are on 10.12.2, which is more buggy/less stable than 10.12.6?
,
Aug 14 2017
Still happening about once per day. Different test each time. Adding a third option next to ui_test_utils::NavigateToURLWithDispositionBlockUntilNavigationsComplete() and content::TestURLLoaderClient::RunUntilResponseReceived(): content::TitleWatcher::WaitAndGetTitle(). Happened here: https://chromium-swarm.appspot.com/task?id=37f76797e15fc810&refresh=10&show_raw=1 It also runs a nested RunLoop essentially waiting for a navigation, so probably the same thing as well.
,
Aug 14 2017
[MacTriage]
,
Aug 18 2017
Looking at https://luci-milo.appspot.com/buildbot/chromium.mac/Mac10.12%20Tests/ I can see that the last failure like this was https://luci-milo.appspot.com/buildbot/chromium.mac/Mac10.12%20Tests/3879 @ 2017-08-15 7:35 AM (CEST) That's more than 3 days ago. Shall we consider it fixed then?
,
Aug 18 2017
Lowering priority since it's currently not a big issue.
,
Aug 22 2017
For the past 200 runs it has been purple a few times but the logs look different, not sure if it's the same bug or not. Swarming times out.
,
Aug 24 2017
Removing the sheriff label as this doesn't seem to be an urgent issue or blocking others.
,
Aug 24
This issue has been Available for over a year. If it's no longer important or seems unlikely to be fixed, please consider closing it out. If it is important, please re-triage the issue. Sorry for the inconvenience if the bug really should have been left as Available. For more details visit https://www.chromium.org/issue-tracking/autotriage - Your friendly Sheriffbot
,
Aug 31
|
||||||||||
►
Sign in to add a comment |
||||||||||
Comment 1 by bpastene@chromium.org
, Jul 25 2017Labels: -Infra-Troopers Sheriff-Chromium OS-Mac