New issue
Advanced search Search tips
Note: Color blocks (like or ) mean that a user may not be available. Tooltip shows the reason.

Issue 839470 link

Starred by 2 users

Issue metadata

Status: Assigned
Owner:
Cc:
Components:
EstimatedDays: ----
NextAction: ----
OS: ----
Pri: 2
Type: ----

Blocking:
issue 838940



Sign in to add a comment

v8.browsing_desktop-future/browse:social:twitter_infinite_scroll in v8.browsing_desktop-future failing flakily on chromium.perf/Win 10 Perf

Project Member Reported by sheriff-...@appspot.gserviceaccount.com, May 3 2018

Issue description

Filed by sheriff-o-matic@appspot.gserviceaccount.com on behalf of sullivan@google.com

May be related to  bug 839411  although we see that on slightly different platforms.

The flakiness dashboard shows a pretty clear start to the flakiness around r552546:r552590. I'll kick off a bisect.

Builders failed on: 
- Win 10 Perf: 
  https://ci.chromium.org/buildbot/chromium.perf/Win%2010%20Perf


 
Cc: u...@chromium.org mythria@chromium.org
Cc-ing benchmark owners. Will disable this story on Windows 10.
Project Member

Comment 3 by bugdroid1@chromium.org, May 3 2018

The following revision refers to this bug:
  https://chromium.googlesource.com/chromium/src.git/+/4f5bfb44ed260127602cfc4994865a80fd5d0eb4

commit 4f5bfb44ed260127602cfc4994865a80fd5d0eb4
Author: Annie Sullivan <sullivan@chromium.org>
Date: Thu May 03 17:25:03 2018

Disable v8.browsing_desktop-future/browse:social:twitter_infinite_scroll on Win.

It is failing flakily.

NOTRY=true
TBR=nednguyen@google.com

Bug: 839470
Change-Id: Ia5657133812ffa541dca0ff0c72db819ee7163ea
Reviewed-on: https://chromium-review.googlesource.com/1042271
Reviewed-by: Annie Sullivan <sullivan@chromium.org>
Commit-Queue: Annie Sullivan <sullivan@chromium.org>
Cr-Commit-Position: refs/heads/master@{#555783}
[modify] https://crrev.com/4f5bfb44ed260127602cfc4994865a80fd5d0eb4/tools/perf/expectations.config

This is also failing on Win7 Nvidia, and on v8.browsing_desktop (non-future version)
Project Member

Comment 5 by bugdroid1@chromium.org, May 3 2018

The following revision refers to this bug:
  https://chromium.googlesource.com/chromium/src.git/+/a5f2f51e43e86c7863595be6e0198b43f550b354

commit a5f2f51e43e86c7863595be6e0198b43f550b354
Author: Annie Sullivan <sullivan@chromium.org>
Date: Thu May 03 17:44:57 2018

Disable v8.browsing_desktop/browse:social:twitter_infinite_scroll on Win.

TBR=nednguyen@google.com
NOTRY=true

Bug: 839470
Change-Id: I7010d53bfdc84b9ac540735df8be060c9e5c2735
Reviewed-on: https://chromium-review.googlesource.com/1042279
Reviewed-by: Annie Sullivan <sullivan@chromium.org>
Commit-Queue: Annie Sullivan <sullivan@chromium.org>
Cr-Commit-Position: refs/heads/master@{#555798}
[modify] https://crrev.com/a5f2f51e43e86c7863595be6e0198b43f550b354/tools/perf/expectations.config

Cc: kbr@chromium.org rdevlin....@chromium.org alex...@chromium.org lukasza@chromium.org isherman@chromium.org pastarmovj@chromium.org
Owner: lukasza@chromium.org
Status: Assigned (was: Available)
📍 Found a significant difference after 1 commit.
https://pinpoint-dot-chromeperf.appspot.com/job/14f5030bc40000

Make --site-per-process the default on ToT via fieldtrial_testing_config by lukasza@chromium.org
https://chromium.googlesource.com/chromium/src/+/fb1ccf02ee8ca79e1404abfd3a3a7d540b7d2dbd

Understanding performance regressions:
  http://g.co/ChromePerformanceRegressions
Cc: dtu@chromium.org
Owner: dtu@chromium.org
dtu: I wanted to pull the logs on the failures to see why the success rate seems to go from 90% to 60% at that CL, but it actually looks like all the failures are swarming task failures and not failures caused by the CL? Can you take a look?
Blocking: 838940
Components: Internals>Sandbox>SiteIsolation
Cc: -isherman@chromium.org

Comment 10 by kbr@chromium.org, May 3 2018

Cc: -kbr@chromium.org

Comment 11 by dtu@chromium.org, May 4 2018

In this case, the Swarming task failures are test failures. So yes, the failures are caused by this CL.
Owner: lukasza@chromium.org
Thanks, Dave! Assigning back to lukasza, here's more information on what's going on:

After your CL, the test got flaky on Windows 10. You can see that on flakiness dashboard: https://test-results.appspot.com/dashboards/flakiness_dashboard.html#testType=v8.browsing_desktop-future&builder=chromium.perf%3AWin%2010%20Perf

So we used pinpoint to bisect, and it narrows down flakiness to your CL (the chart is the success rate of the test): https://pinpoint-dot-chromeperf.appspot.com/job/14f5030bc40000

A very similar thing is happening on  bug 839411 , but the bisect hasn't completed yet.

This might be tricky to debug, because it doesn't seem to happen on all hardware; I think all the machines it's reproducing on are Dell PowerEdge r220s. If you need to reproduce and it's not failing locally we can try to pull one of the machines out of our swarming pool to rdesktop into.

The command line for the test is:
tools/perf/run_benchmark v8.browsing_desktop-future --story-filter browse.social.twitter.infinite.scroll

This is a performance integration test, it tries to open the browser, load a recorded version of twitter, scroll, and collect performance metrics. 

I posted a screenshot showing how to access the logs from pinpoint, but they're not super helpful. They say the test timed out, but this can happen when the browser crashes.

See go/reproduce-telemetry for more documentation.
pinpoint.png
176 KB View Download
Thanks Annie for the detailed steps, I just had a quick look at the logs. It is timing out waiting for a javascript condition when scrolling[1] in all the traces. So, it could be also that some resource is not being loaded with site-isolation? That still doesn't rule out browser crash though.

[1]https://cs.chromium.org/chromium/src/tools/perf/page_sets/system_health/browsing_stories.py?q=system_health/b&sq=package:chromium&l=855
RE: it could be also that some resource is not being loaded with site-isolation

Is it possible to check DevTools console for messages similar to:
Blocked current origin from receiving cross-site document at <URL> with MIME type <text/html, text/json, etc.>.

Alternatively - is it possible to look at the UMA data for SiteIsolation.XSD.Browser.Action and/or SiteIsolation.XSD.Browser.Blocked?

If the message or the UMA are present, it might indicate that Cross-Origin Read Blocking [1] blocks one of the responses.

[1] https://chromium.googlesource.com/chromium/src/+/master/services/network/cross_origin_read_blocking_explainer.md
Cc: nednguyen@chromium.org charliea@chromium.org
 Issue 839411  has been merged into this issue.
Also - is it possible to check for browser crashes (and possibly correlate them to specific Crash reports)?  (#c12 suggests that a browser crash might be one explanation for the observed timeouts of the benchmark)
I looked at the devtools console output (on linux, though the bug is related to windows. I thought the platform shouldn't matter much for checking this). There are no messages related to cross-site document being blocked. This bug is for an internal benchmark. These benchmarks use a recorded web page. I am not sure UMA data would help here?

We get browser crash logs in the pinpoint jobs (#c12 explains how to look for them), but in this case there is nothing in the log. I am not sure if there are other ways of looking for crash logs.
Hmmm... is it possible that Site Isolation affects the timing of various things (e.g. by allowing cross-site frames to load in parallel in their separate renderers) and this confuses the "driver" of the test (e.g. so that it simulates clicks, reloads or other interactions at not quite the right time)?  This theory seems compatible with the observed flakiness (e.g. sometimes even synchronizing via sleep statements will luckily get the right timing).  OTOH, I don't know how one could verify this theory... :-(


1/4 failed test tasks I looked at in https://pinpoint-dot-chromeperf.appspot.com/job/14f5030bc40000 contains a message about a minidump (https://chromium-swarm.appspot.com/task?id=3d40ab456f1ffb10&refresh=10&show_raw=1).  Looking at the minidump points at content::protocol::SystemInfoHandlerGpuObserver::ObserverWatchdogCallback (so possible a dupe of  issue 842782 , similarily to what we've together found in  https://crbug.com/842180#c10  -  https://crbug.com/842180#c17 ).

I am not sure if this is good news (we have found the root cause) or bad news (3/4 of the failed test runs didn't say anything about a minidump, so maybe there are more unknown failure causes hiding here)...  FWIW, the 3 failed tasks I looked at that didn't contain minidumps are the ones below:
- https://chromium-swarm.appspot.com/task?id=3d40ab41a73c0910&refresh=10&show_raw=1
- https://chromium-swarm.appspot.com/task?id=3d40ab42f00f6d10&refresh=10&show_raw=1
- https://chromium-swarm.appspot.com/task?id=3d40ab448f081510&refresh=10&show_raw=1

Sign in to add a comment