New issue
Advanced search Search tips

Issue 648633 link

Starred by 3 users

Issue metadata

Status: Fixed
Owner:
Closed: Nov 2016
Cc:
EstimatedDays: ----
NextAction: ----
OS: ----
Pri: 1
Type: Bug



Sign in to add a comment

Some perf bots cannot send data to dashboard

Project Member Reported by rnep...@chromium.org, Sep 20 2016

Issue description

Bot:
https://build.chromium.org/p/chromium.perf/builders/Android%20Nexus5X%20Perf%20%281%29

Stdio:
https://build.chromium.org/p/chromium.perf/builders/Android%20Nexus5X%20Perf%20%281%29/builds/3172/steps/v8.top_25_smooth/logs/stdio

Error message:
@@@STEP_LINK@Results Dashboard@https://chromeperf.appspot.com/report?masters=ChromiumPerf&bots=android-nexus5X&tests=v8.top_25_smooth&rev=419463@@@
HTTPError: 500. Reponse: 
<html><head>
<meta http-equiv="content-type" content="text/html;charset=utf-8">
<title>500 Server Error</title>
</head>
<body text=#000000 bgcolor=#ffffff>
<h1>Error: Server Error</h1>
<h2>The server encountered an error and could not complete your request.<p>Please try again in 30 seconds.</h2>
<h2></h2>
</body></html>
 
Looking into this. Initial investigation:

* Taking the JSON and POST-ing it in a unit test works correctly, so it's not the JSON.
* We have a lot of requests to the dashboard running out of memory. Those tasks don't print the chartjson. maybe it is too big.
* Trying a new version that prints a condensed version of chartjson.
Summary: Some perf bots cannot send data to dashboard (was: Nexus 5X perf bot cannot send data to dashboard)
Changing name to reflect reality. 
The dashboard doesn't get to the point where it prints out the condensed chartjson before OOM-ing. Rolling back to a version from before the changes I made yesterday, since this started yesterday morning.
Owner: dtu@chromium.org
Dave is working on seeing if he can repro the errors using the JSON from the buildbots.

Comment 6 by estaab@chromium.org, Sep 21 2016

Cc: estaab@chromium.org
It seems to be a memory leak. We've rolled back to a version of dashboard from August 30 and it works correctly, will try to narrow down the problematic CL tomorrow.

Comment 8 by dtu@chromium.org, Sep 22 2016

Working theory:

clean-aiolos-dccd is at instance class F4_1G; the soft memory limit is 1024 MB.

I imply that the app.yaml specified this instance class at some point in the past, and app engine memorized it even after the instance class was deleted from the app.yaml file. So it's been using F4_1G for a long time.

Uploading the bisect module on sept 14, which did specify an instance class of B1, caused app engine to update the default module's instance class to the default value, which is F1 (128 MB limit)
Issue 649446 has been merged into this issue.
Ping
Ping. Did this get fixed?

Comment 12 by zh...@chromium.org, Nov 15 2016

Has this been fixed? The bots are green.
Status: Fixed (was: Untriaged)
Sorry about this! It was fixed.

Sign in to add a comment