False rejects caused by webgl_conformance_vulkan_passthrough_tests on Windows |
|||||
Issue descriptionCL: https://chromium-review.googlesource.com/c/chromium/src/+/1386127/2 Failed build: https://ci.chromium.org/p/chromium/builders/luci.chromium.try/win_optional_gpu_tests_rel/12575 Succeeded on retry: https://ci.chromium.org/p/chromium/builders/luci.chromium.try/win_optional_gpu_tests_rel/12580 Failing task: https://bugs.chromium.org/p/chromium/issues/entry Looks like driver issues? """ [454/469] gpu_tests.webgl_conformance_integration_test.WebGLConformanceIntegrationTest.WebglExtension_OES_texture_half_float_linear passed 1.9700s [6300:7528:1220/030721.590:ERROR:gl_surface_egl.cc(537)] EGL Driver message (Error) eglCreateWindowSurface: Internal Vulkan error: Initialization of an object could not be completed for implementation-specific reasons., in ../../third_party/angle/src/libANGLE/renderer/vulkan/SurfaceVk.cpp, initializeImpl:467. [6300:7528:1220/030721.590:ERROR:gl_surface_egl.cc(1057)] eglCreateWindowSurface failed with error EGL_BAD_SURFACE [6300:7528:1220/030721.590:ERROR:in_process_command_buffer.cc(482)] ContextResult::kSurfaceFailure: Failed to create surface. [6300:3008:1220/030721.590:ERROR:viz_process_context_provider.cc(220)] Failed to initialize InProcessCommmandBuffer [1852:1096:1220/030721.590:ERROR:viz_process_transport_factory.cc(308)] Switching to software compositing. [6300:7528:1220/030721.593:WARNING:ipc_message_attachment_set.cc(49)] MessageAttachmentSet destroyed with unconsumed attachments: 0/1 [455/469] gpu_tests.webgl_conformance_integration_test.WebGLConformanceIntegrationTest.WebglExtension_OES_vertex_array_object passed 0.3750s DevTools console [network]: Failed to load resource: the server responded with a status of 404 (File not found) http://127.0.0.1:58399/favicon.ico [456/469] gpu_tests.webgl_conformance_integration_test.WebGLConformanceIntegrationTest.WebglExtension_TestCoverage passed 0.0940s [1852:1096:1220/030721.999:INFO:CONSOLE(197)] "WEBGL_color_buffer_float was not available", source: (197) Expected exception while running WebglExtension_WEBGL_color_buffer_float Traceback (most recent call last): _RunGpuTest at content\test\gpu\gpu_tests\gpu_integration_test.py:155 self.RunActualGpuTest(url, *args) RunActualGpuTest at content\test\gpu\gpu_tests\webgl_conformance_integration_test.py:199 getattr(self, test_name)(test_path, *args[1:]) _RunExtensionTest at content\test\gpu\gpu_tests\webgl_conformance_integration_test.py:320 self._CheckTestCompletion() _CheckTestCompletion at content\test\gpu\gpu_tests\webgl_conformance_integration_test.py:284 self.fail(self._WebGLTestMessages(self.tab)) fail at .swarming_module\bin\Lib\unittest\case.py:410 raise self.failureException(msg) AssertionError: WEBGL_color_buffer_float was not available Locals: msg : u'WEBGL_color_buffer_float was not available\n' """
,
Dec 20
,
Dec 20
Ken, looking at https://ci.chromium.org/p/chromium/builders/luci.chromium.ci/Win10%20FYI%20Release%20%28NVIDIA%29?limit=200 I see 4 Vulkan specific failures. It makes sense there would be fewer failures than win_optional_gpu_tests_rel because of test de-duplication. Also looking at the original failure for this bug I see it seemed to have a random problem initializing the rendering context for Vulkan: [4676:5632:1220/030356.681:ERROR:gl_surface_egl.cc(537)] EGL Driver message (Error) eglCreateWindowSurface: Internal Vulkan error: Initialization of an object could not be completed for implementation-specific reasons., in ../../third_party/angle/src/libANGLE/renderer/vulkan/SurfaceVk.cpp, initializeImpl:467. [4676:5632:1220/030356.681:ERROR:gl_surface_egl.cc(1057)] eglCreateWindowSurface failed with error EGL_BAD_SURFACE [4676:5632:1220/030357.258:ERROR:in_process_command_buffer.cc(482)] ContextResult::kSurfaceFailure: Failed to create surface. Looking at https://ci.chromium.org/p/chromium/builders/luci.chromium.ci/Win10%20FYI%20Exp%20Release%20%28NVIDIA%29?limit=200 however I see zero failures. Several other failures seem to be due to lost device in the Vulkan back-end. See issue 907544. That makes me think this issue and issue 907544 will be solved by the NVIDIA driver upgrade in issue 887241. Cc'ing Adam who was helping with the driver upgrad.e
,
Dec 21
Can we try to mitigate this problem somehow until the driver update's done? Could you add a blanket Flaky expectation for "*" to webgl_conformance_expectations.py for ['win', 'nvidia', 'passthrough', 'vulkan']? We need to stamp out flakiness on the trybot because it blocks other Chromium developers. Thanks.
,
Dec 21
Yeah. I'm traveling and will be taking time off. If I don't get to it please just TBR a total removal of the vulkan WebGL tests to myself or Geoff.
,
Dec 21
The following revision refers to this bug: https://chromium.googlesource.com/chromium/src.git/+/8837121fe4fc086eb66e50e46f4546528631fcf5 commit 8837121fe4fc086eb66e50e46f4546528631fcf5 Author: Jamie Madill <jmadill@chromium.org> Date: Fri Dec 21 15:50:22 2018 Mark Vulkan WebGL tests as flaky. Until the drivers can be upgraded. Bug: 917046 Tbr: kbr@chromium.org Change-Id: I7696701e71dbef72159ff8eab392729e3326037e Reviewed-on: https://chromium-review.googlesource.com/c/1388545 Reviewed-by: Jamie Madill <jmadill@chromium.org> Commit-Queue: Jamie Madill <jmadill@chromium.org> Cr-Commit-Position: refs/heads/master@{#618538} [modify] https://crrev.com/8837121fe4fc086eb66e50e46f4546528631fcf5/content/test/gpu/gpu_tests/webgl_conformance_expectations.py
,
Dec 21
What's the point of marking the tests as flaky rather than removing them via Skip()? [since we intend to ignore the results] Right now, the tests can still timeout, causing builds to fail even though they're marked as flaky in webgl_conformance_expectations, right?
,
Dec 21
The results are not ignored. In the Telemetry-based GPU test suites, a Flaky expectation means "run the test up to three times, and if it fails all three times, report a failure". If Jamie's flaky expectation for these tests doesn't result in them running reliably then we'll temporarily take them off this bot.
,
Jan 11
The following revision refers to this bug: https://chromium.googlesource.com/chromium/src.git/+/187f4c9551bac0c7b38047d41772f262fc4267d3 commit 187f4c9551bac0c7b38047d41772f262fc4267d3 Author: Jamie Madill <jmadill@chromium.org> Date: Fri Jan 11 20:31:08 2019 Remove flaky Win/Vulkan/NV WebGL suppression. The drivers have been upgraded. Try running the test suites again to see if the device lost problems have gone away. Bug: 917046 Change-Id: I8a8b92e5cdb0f26f7b6f502cbd99c52308767ecd Tbr: kbr@chromium.org Reviewed-on: https://chromium-review.googlesource.com/c/1407089 Reviewed-by: Jamie Madill <jmadill@chromium.org> Commit-Queue: Jamie Madill <jmadill@chromium.org> Cr-Commit-Position: refs/heads/master@{#622124} [modify] https://crrev.com/187f4c9551bac0c7b38047d41772f262fc4267d3/content/test/gpu/gpu_tests/webgl_conformance_expectations.py
,
Jan 14
Don't see any Vulkan flakes in the last 200 builds in https://ci.chromium.org/p/chromium/builders/luci.chromium.ci/Win10%20FYI%20Release%20%28NVIDIA%29?limit=200 Tentatively closing this out. Will rearrange the bug tags to keep the issues associated with the driver upgrade. <3 labs team for the upgrade.
,
Jan 15
Fantastic work Jamie driving this to completion! |
|||||
►
Sign in to add a comment |
|||||
Comment 1 by kbr@chromium.org
, Dec 20Owner: jmad...@chromium.org
Status: Assigned (was: Untriaged)