New issue
Advanced search Search tips
Note: Color blocks (like or ) mean that a user may not be available. Tooltip shows the reason.

Issue 706452 link

Starred by 2 users

Issue metadata

Status: Available
Owner: ----
Cc:
Components:
EstimatedDays: ----
NextAction: ----
OS: Chrome
Pri: 3
Type: Bug



Sign in to add a comment

Mickey crash rate high

Project Member Reported by keta...@chromium.org, Mar 29 2017

Issue description

Cc: dtor@chromium.org
I did a quick skim of crashes marked as bad_magic (30% of the kernel crashes I saw for mickey):

80d8c2c4e0000000:

<6>[ 1804.466624] PM: suspend entry 2017-03-28 11:36:30.868680189 UTC
<6>[ 1804.466641] PM: Syncing filesystems ... done.
<7>[ 1804.489853] PM: Preparing system for mem sleep
<5>[ 1804.490923] Freezing user space processes ... (elapsed 0.001 seconds) done.
<5>[ 1804.492452] Freezing remaining freezable tasks ... (elapsed 0.001 seconds) done.
<7>[ 1804.493664] PM: Entering mem sleep
<6>[ 1804.494004] calling  rfkill1+ @ 3841, parent: hci0, cb: rfkill_suspend
<6>[ 1804.494016] call rfkill1+ returned 0 after 1 usecs
<6>[ 1804.494072] calling  input6+ @ 3841, parent: 0003:06CB:0009.0005, cb: input_dev_suspend
<6>[ 1804.494109] call input6+ returned 0 after 13 usecs
<6>[ 1804.494154] calling  input5+ @ 3841, parent: 0003:06CB:0009.0004, cb: input_dev_suspend
<6>[ 1804.494173] call input5+ returned 0 after 8 usecs
<6>[ 1804.494207] calling  1-1.3.4+ @ 3742, parent: 1-1.3, cb: usb_dev_suspend
<0>[ 1804.506092] BUG: spinlock bad magic on CPU#2, kworker/u8:0/3742
<0>[ 1804.506115]  lock: 0xeb996a1c, .magic: 00000000, .owner: <none>/-1, .owner_cpu: 0
<6>[ 1804.506122] calling  rfkill0+ @ 3841, parent: phy0, cb: rfkill_suspend
<6>[ 1804.506126] call rfkill0+ returned 0 after 0 usecs
<6>[ 1804.506154] calling  phy0+ @ 1383, parent: mmc2:0001:1, cb: wiphy_suspend [cfg80211]
<5>[ 1804.506170] CPU: 2 PID: 3742 Comm: kworker/u8:0 Not tainted 3.14.0 #1
<5>[ 1804.506187] Workqueue: events_unbound async_run_entry_fn
<5>[ 1804.506214] [<c020e550>] (unwind_backtrace) from [<c020a910>] (show_stack+0x20/0x24)
<5>[ 1804.506235] [<c020a910>] (show_stack) from [<c07f8534>] (dump_stack+0x7c/0xc0)
<5>[ 1804.506255] [<c07f8534>] (dump_stack) from [<c026cd98>] (spin_dump+0x88/0x9c)
<5>[ 1804.506269] [<c026cd98>] (spin_dump) from [<c026cde0>] (spin_bug+0x34/0x38)
<6>[ 1804.506275] calling  input4+ @ 3841, parent: 0003:04B3:3018.0003, cb: input_dev_suspend
<5>[ 1804.506297] [<c026cde0>] (spin_bug) from [<c026ce7c>] (do_raw_spin_lock+0x30/0x18c)
<5>[ 1804.506312] [<c026ce7c>] (do_raw_spin_lock) from [<c07fd1c8>] (_raw_spin_lock+0x30/0x34)
<6>[ 1804.506316] call input4+ returned 0 after 10 usecs
<6>[ 1804.506336] calling  input3+ @ 3841, parent: 0003:04B3:3018.0002, cb: input_dev_suspend
<5>[ 1804.506343] [<c07fd1c8>] (_raw_spin_lock) from [<c07fc058>] (__mutex_lock_slowpath+0x138/0x25c)
<5>[ 1804.506351] [<c07fc058>] (__mutex_lock_slowpath) from [<c07fc1bc>] (mutex_lock+0x40/0x58)
<5>[ 1804.506359] [<c07fc1bc>] (mutex_lock) from [<c06842b0>] (rmi_set_sleep_mode+0x54/0x100)
<5>[ 1804.506367] [<c06842b0>] (rmi_set_sleep_mode) from [<c0684420>] (rmi_suspend+0x74/0x8c)
<5>[ 1804.506374] [<c0684420>] (rmi_suspend) from [<c0687490>] (hid_suspend+0x130/0x1e8)
<5>[ 1804.506381] [<c0687490>] (hid_suspend) from [<c05ae3b0>] (usb_suspend_both+0x90/0x1a8)
<5>[ 1804.506387] [<c05ae3b0>] (usb_suspend_both) from [<c05aee08>] (usb_suspend+0xf8/0xfc)
<5>[ 1804.506392] [<c05aee08>] (usb_suspend) from [<c05a0c88>] (usb_dev_suspend+0x1c/0x20)
<5>[ 1804.506401] [<c05a0c88>] (usb_dev_suspend) from [<c055612c>] (dpm_run_callback+0x48/0x84)
<5>[ 1804.506407] [<c055612c>] (dpm_run_callback) from [<c05571bc>] (__device_suspend+0x21c/0x2e0)
<5>[ 1804.506414] [<c05571bc>] (__device_suspend) from [<c05572a8>] (async_suspend+0x28/0xa0)
<5>[ 1804.506420] [<c05572a8>] (async_suspend) from [<c024c9b4>] (async_run_entry_fn+0x8c/0x188)
<5>[ 1804.506429] [<c024c9b4>] (async_run_entry_fn) from [<c023f478>] (process_one_work+0x2cc/0x460)
<5>[ 1804.506436] [<c023f478>] (process_one_work) from [<c023fb00>] (worker_thread+0x234/0x38c)
<5>[ 1804.506442] [<c023fb00>] (worker_thread) from [<c02457d0>] (kthread+0xe4/0xfc)
<5>[ 1804.506448] [<c02457d0>] (kthread) from [<c02064f8>] (ret_from_fork+0x14/0x20)
<1>[ 1804.506456] Unable to handle kernel NULL pointer dereference at virtual address 00000000

---

5c77621640000000: (same client ID)

<6>[ 1804.377864] PM: suspend entry 2017-03-28 10:36:18.873725730 UTC
<6>[ 1804.377882] PM: Syncing filesystems ... done.
<7>[ 1804.400055] PM: Preparing system for mem sleep
<5>[ 1804.401183] Freezing user space processes ... (elapsed 0.061 seconds) done.
<5>[ 1804.462441] Freezing remaining freezable tasks ... (elapsed 0.001 seconds) done.
<7>[ 1804.463675] PM: Entering mem sleep
<6>[ 1804.464042] calling  rfkill1+ @ 2836, parent: hci0, cb: rfkill_suspend
<6>[ 1804.464055] call rfkill1+ returned 0 after 1 usecs
<6>[ 1804.464127] calling  input6+ @ 2836, parent: 0003:06CB:0009.0005, cb: input_dev_suspend
<6>[ 1804.464158] call input6+ returned 0 after 15 usecs
<6>[ 1804.464186] calling  input5+ @ 2836, parent: 0003:06CB:0009.0004, cb: input_dev_suspend
<6>[ 1804.464208] call input5+ returned 0 after 9 usecs
<6>[ 1804.464232] calling  rfkill0+ @ 2836, parent: phy0, cb: rfkill_suspend
<6>[ 1804.464246] calling  1-1.3.4+ @ 2837, parent: 1-1.3, cb: usb_dev_suspend
<6>[ 1804.464257] call rfkill0+ returned 0 after 0 usecs
<6>[ 1804.464282] calling  input4+ @ 2836, parent: 0003:04B3:3018.0003, cb: input_dev_suspend
<6>[ 1804.464310] calling  phy0+ @ 85, parent: mmc2:0001:1, cb: wiphy_suspend [cfg80211]
<6>[ 1804.464340] call input4+ returned 0 after 11 usecs
<6>[ 1804.464369] calling  input3+ @ 2836, parent: 0003:04B3:3018.0002, cb: input_dev_suspend
<6>[ 1804.464395] call input3+ returned 0 after 13 usecs
<6>[ 1804.464420] calling  regulatory.0+ @ 2836, parent: platform, cb: platform_pm_suspend
<6>[ 1804.464432] call regulatory.0+ returned 0 after 0 usecs
<6>[ 1804.464482] calling  input2+ @ 2836, parent: 0003:0461:4D22.0001, cb: input_dev_suspend
<6>[ 1804.464504] call input2+ returned 0 after 9 usecs
<6>[ 1804.464527] calling  mmc2:0001:3+ @ 2836, parent: mmc2:0001, cb: pm_no_operation
<6>[ 1804.464535] calling  1-1.3.3+ @ 2839, parent: 1-1.3, cb: usb_dev_suspend
<6>[ 1804.464548] call mmc2:0001:3+ returned 0 after 0 usecs
<6>[ 1804.464562] calling  mmc2:0001:2+ @ 2836, parent: mmc2:0001, cb: pm_no_operation
<6>[ 1804.464573] call mmc2:0001:2+ returned 0 after 0 usecs
<6>[ 1804.464677] calling  1-1.4+ @ 2840, parent: 1-1, cb: usb_dev_suspend
<6>[ 1804.464886] calling  1-1.2+ @ 2842, parent: 1-1, cb: usb_dev_suspend
<6>[ 1804.468910] call 1-1.4+ returned 0 after 4118 usecs
<6>[ 1804.472582] cfg80211: Calling CRDA to update world regulatory domain
<0>[ 1804.476232] BUG: spinlock bad magic on CPU#1, kworker/u8:0/2837
<6>[ 1804.476236] call 1-1.2+ returned 0 after 11070 usecs
<6>[ 1804.476247] call 1-1.3.3+ returned 0 after 11432 usecs
<0>[ 1804.476276]  lock: 0xeb921a1c, .magic: 00000000, .owner: <none>/-1, .owner_cpu: 0
<5>[ 1804.476290] CPU: 1 PID: 2837 Comm: kworker/u8:0 Not tainted 3.14.0 #1
<5>[ 1804.476308] Workqueue: events_unbound async_run_entry_fn
<5>[ 1804.476334] [<c020e550>] (unwind_backtrace) from [<c020a910>] (show_stack+0x20/0x24)
<5>[ 1804.476354] [<c020a910>] (show_stack) from [<c07f8534>] (dump_stack+0x7c/0xc0)
<5>[ 1804.476372] [<c07f8534>] (dump_stack) from [<c026cd98>] (spin_dump+0x88/0x9c)
<5>[ 1804.476387] [<c026cd98>] (spin_dump) from [<c026cde0>] (spin_bug+0x34/0x38)
<5>[ 1804.476401] [<c026cde0>] (spin_bug) from [<c026ce7c>] (do_raw_spin_lock+0x30/0x18c)
<5>[ 1804.476416] [<c026ce7c>] (do_raw_spin_lock) from [<c07fd1c8>] (_raw_spin_lock+0x30/0x34)
<5>[ 1804.476434] [<c07fd1c8>] (_raw_spin_lock) from [<c07fc058>] (__mutex_lock_slowpath+0x138/0x25c)
<5>[ 1804.476450] [<c07fc058>] (__mutex_lock_slowpath) from [<c07fc1bc>] (mutex_lock+0x40/0x58)
<5>[ 1804.476469] [<c07fc1bc>] (mutex_lock) from [<c06842b0>] (rmi_set_sleep_mode+0x54/0x100)
<5>[ 1804.476486] [<c06842b0>] (rmi_set_sleep_mode) from [<c0684420>] (rmi_suspend+0x74/0x8c)
<5>[ 1804.476501] [<c0684420>] (rmi_suspend) from [<c0687490>] (hid_suspend+0x130/0x1e8)
<5>[ 1804.476516] [<c0687490>] (hid_suspend) from [<c05ae3b0>] (usb_suspend_both+0x90/0x1a8)
<5>[ 1804.476531] [<c05ae3b0>] (usb_suspend_both) from [<c05aee08>] (usb_suspend+0xf8/0xfc)
<5>[ 1804.476546] [<c05aee08>] (usb_suspend) from [<c05a0c88>] (usb_dev_suspend+0x1c/0x20)
<5>[ 1804.476563] [<c05a0c88>] (usb_dev_suspend) from [<c055612c>] (dpm_run_callback+0x48/0x84)
<5>[ 1804.476579] [<c055612c>] (dpm_run_callback) from [<c05571bc>] (__device_suspend+0x21c/0x2e0)
<5>[ 1804.476595] [<c05571bc>] (__device_suspend) from [<c05572a8>] (async_suspend+0x28/0xa0)
<5>[ 1804.476611] [<c05572a8>] (async_suspend) from [<c024c9b4>] (async_run_entry_fn+0x8c/0x188)
<5>[ 1804.476630] [<c024c9b4>] (async_run_entry_fn) from [<c023f478>] (process_one_work+0x2cc/0x460)
<5>[ 1804.476646] [<c023f478>] (process_one_work) from [<c023fb00>] (worker_thread+0x234/0x38c)
<5>[ 1804.476660] [<c023fb00>] (worker_thread) from [<c02457d0>] (kthread+0xe4/0xfc)
<5>[ 1804.476675] [<c02457d0>] (kthread) from [<c02064f8>] (ret_from_fork+0x14/0x20)
<1>[ 1804.476691] Unable to handle kernel NULL pointer dereference at virtual address 00000000

---

And the next one is from the same client and looks the same.  So looks like some client is suspending every 1800 seconds and hitting the same crash over and over again.

Actually, that one client appears to be about 20% of all mickey kernel crashes and nearly all of the "bad magic" ones.


Skimming hung tasks:

---

80552dc4e0000000:

Broadcom WiFi and/or dwmmc:

<6>[439219.486470] tpm_i2c_infineon 1-0020: command 0x65 (size 22) returned code 0x0
<3>[439517.994283] mmc_host mmc2: Timeout sending command (cmd 0x200000 arg 0x0 status 0x80200000)
<3>[439518.504167] dwmmc_rockchip ff0d0000.dwmmc: Busy; trying anyway
<3>[439525.984575] brcmfmac: brcmf_proto_bcdc_query_dcmd: brcmf_proto_bcdc_msg failed w/status -5
...
...
<3>[439665.034713] brcmfmac: brcmf_cfg80211_scan: scan error (-5)
<3>[439680.174528] INFO: task ksdioirqd/mmc2:395 blocked for more than 120 seconds.
<3>[439680.174646]       Not tainted 3.14.0 #1

---

051a8544e0000000:

Broadcom bluetooth vs. suspend/resume (I spent time on this type of thing ages ago, but didn't get to the root cause of everything):

<7>[110911.241414] PM: Preparing system for mem sleep
<3>[110919.025785] Bluetooth: hci0 link tx timeout
<3>[110919.025801] Bluetooth: hci0 killing stalled connection 00:00:00:00:00:02
<3>[110919.026191] Bluetooth: hci0 link tx timeout
<3>[110919.026201] Bluetooth: hci0 killing stalled connection 00:00:00:00:00:02
<12>[111112.295756] init: trim main process (1629) terminated with status 1
<3>[111120.174188] INFO: task kworker/u9:0:100 blocked for more than 120 seconds.
<3>[111120.174200]       Tainted: G        W    3.14.0 #1
<3>[111120.174207] "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message.
<6>[111120.174215] kworker/u9:0    D c06db63c     0   100      2 0x00000000
<6>[111120.174241] Workqueue: hci0 hci_rx_work [bluetooth]
<5>[111120.174259] [<c06db63c>] (__schedule) from [<c06db998>] (schedule+0xa4/0xa8)
<5>[111120.174272] [<c06db998>] (schedule) from [<c06dbd5c>] (schedule_preempt_disabled+0x30/0x40)
<5>[111120.174284] [<c06dbd5c>] (schedule_preempt_disabled) from [<c06ddc18>] (__mutex_lock_slowpath+0x1ac/0x25c)
<5>[111120.174296] [<c06ddc18>] (__mutex_lock_slowpath) from [<c06ddd08>] (mutex_lock+0x40/0x58)
<5>[111120.174310] [<c06ddd08>] (mutex_lock) from [<c04462a4>] (device_pm_remove+0x2c/0x68)
<5>[111120.174323] [<c04462a4>] (device_pm_remove) from [<c0439f64>] (device_del+0x120/0x17c)
<5>[111120.174342] [<c0439f64>] (device_del) from [<bf1421bc>] (hci_conn_del_sysfs+0x60/0x78 [bluetooth])
<5>[111120.174372] [<bf1421bc>] (hci_conn_del_sysfs [bluetooth]) from [<bf12a8dc>] (hci_chan_list_flush+0x12c/0x154 [bluetooth])
<5>[111120.174397] [<bf12a8dc>] (hci_chan_list_flush [bluetooth]) from [<bf12ab14>] (hci_conn_del+0x154/0x15c [bluetooth])
<5>[111120.174421] [<bf12ab14>] (hci_conn_del [bluetooth]) from [<bf12e160>] (read_enc_key_size_complete+0x4f8/0x5370 [bluetooth])
<5>[111120.174446] [<bf12e160>] (read_enc_key_size_complete [bluetooth]) from [<bf1332e0>] (hci_event_packet+0x308/0x1064 [bluetooth])
<5>[111120.174470] [<bf1332e0>] (hci_event_packet [bluetooth]) from [<bf124438>] (hci_rx_work+0xfc/0x230 [bluetooth])
<5>[111120.174490] [<bf124438>] (hci_rx_work [bluetooth]) from [<c013e074>] (process_one_work+0x244/0x438)
<5>[111120.174503] [<c013e074>] (process_one_work) from [<c013f2bc>] (worker_thread+0x234/0x38c)
<5>[111120.174514] [<c013f2bc>] (worker_thread) from [<c0144f9c>] (kthread+0xe4/0xfc)
<5>[111120.174526] [<c0144f9c>] (kthread) from [<c01064f8>] (ret_from_fork+0x14/0x20)

---

1330209360000000:

Some sort of Broadcom errors:

<3>[ 8903.944505] brcmfmac: brcmf_proto_bcdc_query_dcmd: brcmf_proto_bcdc_msg failed w/status -5
<3>[ 8903.944694] brcmfmac: brcmf_cfg80211_get_station: GET STA INFO failed, -5
<3>[ 8905.944514] brcmfmac: brcmf_proto_bcdc_query_dcmd: brcmf_proto_bcdc_msg failed w/status -5
<3>[ 8905.944701] brcmfmac: brcmf_cfg80211_get_station: Could not get rssi (-5)
...
...
<3>[ 9120.164312] INFO: task kworker/u8:3:14071 blocked for more than 120 seconds.
<3>[ 9120.164331]       Not tainted 3.14.0 #1
<3>[ 9120.164340] "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message.
<6>[ 9120.164350] kworker/u8:3    D c0680ce4     0 14071      2 0x00000000
<6>[ 9120.164381] Workqueue: brcmf_wq brcmf_sdio_dataworker [brcmfmac]
<5>[ 9120.164404] [<c0680ce4>] (__schedule) from [<c068105c>] (schedule+0xa4/0xa8)
<5>[ 9120.164417] [<c068105c>] (schedule) from [<c06800b8>] (schedule_timeout+0x38/0x268)
<5>[ 9120.164430] [<c06800b8>] (schedule_timeout) from [<c0681cf8>] (wait_for_common+0x148/0x188)
<5>[ 9120.164443] [<c0681cf8>] (wait_for_common) from [<c0681d58>] (wait_for_completion+0x20/0x24)
<5>[ 9120.164459] [<c0681d58>] (wait_for_completion) from [<c0508064>] (mmc_wait_for_req+0x84/0x158)
<5>[ 9120.164479] [<c0508064>] (mmc_wait_for_req) from [<bf0f3c90>] (brcmf_ops_sdio_probe+0x6d8/0x900 [brcmfmac])
<5>[ 9120.164510] [<bf0f3c90>] (brcmf_ops_sdio_probe [brcmfmac]) from [<bf0f46c4>] (brcmf_sdiod_send_pkt+0xa4/0xcc [brcmfmac])
<5>[ 9120.164531] [<bf0f46c4>] (brcmf_sdiod_send_pkt [brcmfmac]) from [<bf0f2944>] (brcmf_sdio_dataworker+0x1a20/0x1c30 [brcmfmac])
<5>[ 9120.164553] [<bf0f2944>] (brcmf_sdio_dataworker [brcmfmac]) from [<c013def0>] (process_one_work+0x244/0x438)

---

8d4f7e1360000000:

More Broadcom fun:

<3>[2364942.574765] brcmfmac: brcmf_proto_bcdc_query_dcmd: brcmf_proto_bcdc_msg failed w/status -5
<3>[2364942.574900] brcmfmac: brcmf_cfg80211_get_station: GET STA INFO failed, -5
<3>[2364944.574572] brcmfmac: brcmf_proto_bcdc_query_dcmd: brcmf_proto_bcdc_msg failed w/status -5
...
...
<3>[2365060.614669] brcmfmac: brcmf_get_bss_info: Could not get BSS_INFO (-5)
<3>[2365060.614763] brcmfmac: brcmf_cfg80211_get_station: Could not get BSS_INFO (-5)
<3>[2365080.324358] INFO: task kworker/u8:2:8949 blocked for more than 120 seconds.
<3>[2365080.324374]       Not tainted 3.14.0 #1
<3>[2365080.324380] "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message.
<6>[2365080.324390] kworker/u8:2    D c0684984     0  8949      2 0x00000000
<6>[2365080.324416] Workqueue: brcmf_wq brcmf_sdio_dataworker [brcmfmac]
<5>[2365080.324437] [<c0684984>] (__schedule) from [<c0684cfc>] (schedule+0xa4/0xa8)
<5>[2365080.324453] [<c0684cfc>] (schedule) from [<c0683d58>] (schedule_timeout+0x38/0x268)
<5>[2365080.324469] [<c0683d58>] (schedule_timeout) from [<c0685998>] (wait_for_common+0x148/0x188)
<5>[2365080.324485] [<c0685998>] (wait_for_common) from [<c06859f8>] (wait_for_completion+0x20/0x24)
<5>[2365080.324504] [<c06859f8>] (wait_for_completion) from [<c0509cb4>] (mmc_wait_for_req+0x84/0x158)
<5>[2365080.324528] [<c0509cb4>] (mmc_wait_for_req) from [<bf13fc94>] (brcmf_ops_sdio_probe+0x6d8/0x900 [brcmfmac])
<5>[2365080.324556] [<bf13fc94>] (brcmf_ops_sdio_probe [brcmfmac]) from [<bf1406c8>] (brcmf_sdiod_send_pkt+0xa4/0xcc [brcmfmac])
<5>[2365080.324576] [<bf1406c8>] (brcmf_sdiod_send_pkt [brcmfmac]) from [<bf13e948>] (brcmf_sdio_dataworker+0x1a20/0x1c30 [brcmfmac])
<5>[2365080.324598] [<bf13e948>] (brcmf_sdio_dataworker [brcmfmac]) from [<c013def0>] (process_one_work+0x244/0x438)

---

5377b104e0000000:

Totally unexpected, but crashed in Broadcom code.  

<3>[679766.512625] mmc_host mmc2: Timeout sending command (cmd 0x200000 arg 0x0 status 0x80200000)
<3>[679767.022603] dwmmc_rockchip ff0d0000.dwmmc: Busy; trying anyway
<3>[679774.872898] brcmfmac: brcmf_proto_bcdc_query_dcmd: brcmf_proto_bcdc_msg failed w/status -5
...
...
<3>[679920.182618] INFO: task ksdioirqd/mmc2:416 blocked for more than 120 seconds.
<3>[679920.182636]       Not tainted 3.14.0 #1
<3>[679920.182645] "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message.
<6>[679920.182656] ksdioirqd/mmc2  D c06db63c     0   416      2 0x00000000
<5>[679920.182688] [<c06db63c>] (__schedule) from [<c06db998>] (schedule+0xa4/0xa8)
<5>[679920.182707] [<c06db998>] (schedule) from [<c0549584>] (__mmc_claim_host+0xd8/0x1c0)
<5>[679920.182726] [<c0549584>] (__mmc_claim_host) from [<c05561b8>] (sdio_irq_thread+0x74/0x2e4)
<5>[679920.182742] [<c05561b8>] (sdio_irq_thread) from [<c0144f9c>] (kthread+0xe4/0xfc)
<5>[679920.182760] [<c0144f9c>] (kthread) from [<c01064f8>] (ret_from_fork+0x14/0x20)

---

4bb9e7f8e0000000:

Broadcom not immediately implicated (!).  Crash looks like this:

<6>[11885.589431] PM: suspend entry 2017-03-28 16:28:27.120858129 UTC
<6>[11885.589457] PM: Syncing filesystems ... done.
<7>[11885.610989] PM: Preparing system for mem sleep
<3>[12120.165677] INFO: task kworker/u8:0:5957 blocked for more than 120 seconds.
<3>[12120.165700]       Tainted: G        W    3.14.0 #1
<3>[12120.165708] "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message.
<6>[12120.165719] kworker/u8:0    D c06db63c     0  5957      2 0x00000000
<6>[12120.165748] Workqueue: events_unbound async_run_entry_fn
<5>[12120.165774] [<c06db63c>] (__schedule) from [<c06db998>] (schedule+0xa4/0xa8)
<5>[12120.165788] [<c06db998>] (schedule) from [<c06dbd5c>] (schedule_preempt_disabled+0x30/0x40)
<5>[12120.165808] [<c06dbd5c>] (schedule_preempt_disabled) from [<c06ddc18>] (__mutex_lock_slowpath+0x1ac/0x25c)
<5>[12120.165823] [<c06ddc18>] (__mutex_lock_slowpath) from [<c06ddd08>] (mutex_lock+0x40/0x58)
<5>[12120.165840] [<c06ddd08>] (mutex_lock) from [<c044f998>] (_request_firmware+0x424/0x648)
<5>[12120.165856] [<c044f998>] (_request_firmware) from [<c044fbf8>] (request_firmware+0x3c/0x50)
<5>[12120.165870] [<c044fbf8>] (request_firmware) from [<c044fc44>] (__async_dev_cache_fw_image+0x38/0xa0)
<5>[12120.165884] [<c044fc44>] (__async_dev_cache_fw_image) from [<c014c17c>] (async_run_entry_fn+0x8c/0x188)
<5>[12120.165906] [<c014c17c>] (async_run_entry_fn) from [<c013e074>] (process_one_work+0x244/0x438)
<5>[12120.165924] [<c013e074>] (process_one_work) from [<c013f2bc>] (worker_thread+0x234/0x38c)
<5>[12120.165938] [<c013f2bc>] (worker_thread) from [<c0144f9c>] (kthread+0xe4/0xfc)
<5>[12120.165955] [<c0144f9c>] (kthread) from [<c01064f8>] (ret_from_fork+0x14/0x20)

...but you'll be relieved to know that this sounds a lot like things I dug into in b/35555692 that seemed related to Broadcom Bluetooth.  :-P

Summary: Mickey crash rate high (was: Mickey renderer crashes high at 427 Browser CPM)
Browser rate is high at 427(2-3x normal) CPM. Kernel is high at 138 CPM (10x normal).

Other numbers look fine.
Forked comment #1 (USB HID crash) to  bug #706899 
Project Member

Comment 5 by sheriffbot@chromium.org, Apr 2 2018

Labels: Hotlist-Recharge-Cold
Status: Untriaged (was: Available)
This issue has been Available for over a year. If it's no longer important or seems unlikely to be fixed, please consider closing it out. If it is important, please re-triage the issue.

Sorry for the inconvenience if the bug really should have been left as Available.

For more details visit https://www.chromium.org/issue-tracking/autotriage - Your friendly Sheriffbot
Cc: briannorris@chromium.org diand...@chromium.org cernekee@chromium.org
Owner: ----
Status: Available (was: Untriaged)
Moving to available.  No idea how important this is, but it's not something I'm spending time on at the moment.  Adding a few WiFi folks in case they are interested.
Labels: Kernel-3.14 OS-Chrome
Components: OS>Kernel OS>Systems>Network
Labels: Enterprise-Triaged

Sign in to add a comment