Crashes aren't symbolized on mac perf bots |
||||||
Issue descriptionCrashes don't seem to get symbolized on mac perf bots. For example: https://build.chromium.org/p/chromium.perf/builders/Mac%2010.11%20Perf%20%284%29/builds/3879/steps/v8.browsing_desktop/logs/stdio/text The stack trace just has numeric addresses: Stack Trace: ******************************************************************************** Operating system: Mac OS X 10.11.1 15B42 CPU: amd64 family 6 model 58 stepping 9 8 CPUs GPU: UNKNOWN Crash reason: EXC_BAD_ACCESS / EXC_I386_GPFLT Crash address: 0x10df6422c Process uptime: 31 seconds Thread 8 (crashed) 0 Google Chrome Framework!ChromeMain + 0x129f7c rax = 0x00007f8cd608a980 rdx = 0x0000000000002821 rcx = 0x0002000000008365 rbx = 0x00007f8cd608a980 rsi = 0x0000000000014760 rdi = 0x00007f8cd26eac10 rbp = 0x000070000322e9e0 rsp = 0x000070000322e9d0 r8 = 0x00007f8cd26eac30 r9 = 0x0000000000000001 r10 = 0x0000000000000940 r11 = 0x0000000000001000 r12 = 0x000000010df656b0 r13 = 0x00007f8cd382c400 r14 = 0x00007f8cd26eac20 r15 = 0x00007f8cd2216ff0 rip = 0x000000010df6422c Found by: given as instruction pointer in context 1 Google Chrome Framework!ChromeMain + 0x12b412 rbx = 0x00007f8cd2216ff0 rbp = 0x000070000322ea00 rsp = 0x000070000322e9f0 r12 = 0x000000010df656b0 r13 = 0x00007f8cd382c400 r14 = 0x0000000125b8f7a0 r15 = 0x00007f8cd2216ff0 rip = 0x000000010df656c2 Found by: call frame info ...
,
Aug 10 2016
Oh, it looks like the unittest is currently flaky: issue 633761 :-(
,
Aug 10 2016
Probably why the test is flaky, there is obviously an issue with it being symbolized correctly. So maybe the test is failing correctly! I will investigate further.
,
Aug 10 2016
So if you look at the link to the logs above: https://build.chromium.org/p/chromium.perf/builders/Mac%2010.11%20Perf%20%284%29/builds/3879/steps/v8.browsing_desktop/logs/stdio/text and the link in issue 633761 for the flaky test asserting we have stack traces: https://build.chromium.org/p/chromium.mac/builders/Mac10.10%20Tests/builds/7722/steps/telemetry_perf_unittests/logs/stdio and there is common output right before the stack trace (or lack there of) is dumped out: 2016-08-10 05:03:16.235 Google Chrome[12693:100439] NSWindow warning: adding an unknown subview: <FullSizeContentView: 0x7fdf5a7c3640>. Break on NSLog to debug. 2016-08-10 05:03:16.235 Google Chrome[12693:100439] Call stack: ( "+callStackSymbols disabled for performance reasons" ) Maybe this is completely unrelated as it has to do with chrome, but I thought it was worth investigating. It could be generated from this file within chrome: https://cs.chromium.org/chromium/src/chrome/browser/ui/cocoa/full_size_content_window.mm but I haven't dug through it or tried to understand it enough to know if it is related. Either one of two things is going on here which would help in my path to debugging. Since it is failing similarly on both the perf waterfall and on the chromium waterfall (and by failing I mean failing to produce an accurate stack trace that can be symbolized although they may be failing differently) I have to assume one of two things: 1) There is something specific to how chrome is interacting with mac and there is a mac setting or issue where it is not producing the stack trace in all cases. I have cc'd erikchen@ as I was told he was a mac expert. 2) Or there is something that telemetry is not aware of with regards to mac and the actual telemetry code is inaccurately trying to obtain the stack traces. If #1 reproducing the error on a mac and trying to understand the mac infrastructure might help me debug this easier and with #2 it might be easier to look at the telemetry code and try gathering more information on the unittest failures. This could go back to adding more utilities besides screenshots to obtain system data at the time of crash.
,
Aug 10 2016
,
Aug 10 2016
eyaich: The console logs you quoted """ 2016-08-10 05:03:16.235 Google Chrome[12693:100439] NSWindow warning: adding an unknown subview: <FullSizeContentView: 0x7fdf5a7c3640>. Break on NSLog to debug. 2016-08-10 05:03:16.235 Google Chrome[12693:100439] Call stack: ( "+callStackSymbols disabled for performance reasons" ) """ are not relevant. I'm not familiar with how Telemetry emits stack traces during a crash, nor the symbolication process. I would recommend building a custom version of Chrome locally that crashes shortly after startup, and then testing locally to see if you can get symbolicated crashes.
,
Sep 28 2016
Is that issue becoming irrelevant as per migration to swarming (issue 633253) ?
,
Sep 28 2016
Marc: that's a good point. Though it's unsure whether the Chrome build config of perf benchmark will be the same as ones we use for unittest.
,
Oct 3 2016
,
Jan 16
,
Jan 16
|
||||||
►
Sign in to add a comment |
||||||
Comment 1 by nedngu...@google.com
, Aug 10 2016Labels: -Pri-3 Pri-2
Owner: eyaich@chromium.org
Status: Assigned (was: Untriaged)