Collect feedback reports which contain OOM kill events and analyze their cause |
||
Issue descriptionVovo has collected some crash logs and analyzed OOM causes from it. To get more OOM logs, I'm turning to feedback reports to get more data. Currently there lacks a way to bulk download feedback reports and filter those we're interested in. Feedback team has a stubby server which we can list/query feedback reports and download system_logs.txt. However the service is not for bulk download, so it would be fairly slow to examine a large amount (say, 1 month) of feedback reports. I've contacted feedback support team and we're seeking for ways to attach OOM labels to feedback reports so we can search by labels. In the meanwhile, I'm writing a script to download feedback in parallel (e.g., multiple worker or a map-reduce program). After that I'll analyze those collected report to understand why OOM happens.
,
May 29 2018
do we have any ideas on what caused the sudden drop?
,
May 29 2018
FYI, the downloader code has been submitted to my google3 experimental directory: https://critique.corp.google.com/#review/198438683 Sample usage: 1. checkout the source 2. google3$ blaze build experimental/users/cylee/fb_downloader 3. google3$ blaze-bin/experimental/users/cylee/fb_downloader/fb_downloader --date 20180516 --download_dir ./non_automatic --min_chrome_version 65 or you can run fb_downloader --help to see all options.
,
May 30 2018
Here's some statistics by analyzing OOM log from 20180516: anon < 100MB and anon < swap_free: 36 (20.81%) swap_free < 100MB and swap_free < anon: 76 (43.93%) others: 61 (35.26%) Total: 173 Details are in the attached file. vovo has plans to solve OOM caused by the first case (anon ~= 0) and the second case (swap free ~= 0). However there're still other cases we need to investigate.
,
Sep 4
|
||
►
Sign in to add a comment |
||
Comment 1 by cylee@chromium.org
, May 29 2018