"content_unittests (with patch)" is flaky |
|||||||||
Issue description"content_unittests (with patch)" is flaky. This issue was created automatically by the chromium-try-flakes app. Please find the right owner to fix the respective test/step and assign this issue to them. If the step/test is infrastructure-related, please add Infra-Troopers label and change issue status to Untriaged. When done, please remove the issue from Sheriff Bug Queue by removing the Sheriff-Chromium label. We have detected 5 recent flakes. List of all flakes can be found at https://chromium-try-flakes.appspot.com/all_flake_occurrences?key=ahVzfmNocm9taXVtLXRyeS1mbGFrZXNyKQsSBUZsYWtlIh5jb250ZW50X3VuaXR0ZXN0cyAod2l0aCBwYXRjaCkM. This flaky test/step was previously tracked in issue 625346 .
,
Oct 13 2016
,
Oct 14 2016
Reactivating this issue as there have been a few more flakes in the last 12 or so hours. Since it still appears to be a timeout issue, adding the infra-troopers tag.
,
Oct 14 2016
I'm not seeing examples where it's a timeout? Could you add a link?
,
Oct 14 2016
It's definitely possible that I'm not reading the logs correctly, but here is an example from https://uberchromegw.corp.google.com/i/chromium.linux/builders/Android%20Tests/builds/33362/steps/content_unittests%20on%20Android/logs/stdio E 1285.004s run_tests_on_device(06adc6ac003b6d51) Shard timed out: run_tests_on_device(06adc6ac003b6d51) I 1346.615s run_tests_on_device(0717f034003bca6a) Still working on StartInstrumentation(0717f034003bca6a, org.chromium.native_test/org.chromium.native_test.NativeTestInstrumentationTestRunner, retries=0, extras={'org.chromium.native_test.NativeTestInstrumentationTestRunner.ShardNanoTimeout': 120000000000, 'org.chromium.native_test.NativeTestInstrumentationTestRunner.NativeTestActivity': 'org.chromium.native_test.NativeUnitTestActivity', 'org.chromium.native_test.NativeTestInstrumentationTestRunner.TestList': '/data/local/tmp/temp_file-6028d20f38efe'}, timeout=120.0, raw=False) C 1406.592s run_tests_on_device(0717f034003bca6a) Timed out. Dumping threads. So far today I have seen this twice on the Android builder.
,
Oct 17 2016
This looks like a timeout in catapult to me, maybe? Adding some folks in case they see something.
,
Oct 17 2016
devil/pylib --> jbudorick, rnephew
,
Oct 17 2016
The way the android test runner works is that it will break the tests up into chunks of 256, and attempt to run them. One of these chunks seems to be timing out.
E 185.066s run_tests_on_device(06b2c6d9003bfabf) Shard timed out: run_tests_on_device(06b2c6d9003bfabf)
Traceback (most recent call last):
File "/b/swarm_slave/w/irRQvqQ0/build/android/pylib/local/device/local_device_environment.py", line 55, in wrapper
return f(dev, *args, **kwargs)
File "/b/swarm_slave/w/irRQvqQ0/build/android/pylib/local/device/local_device_test_run.py", line 62, in run_tests_on_device
result = self._RunTest(dev, test)
File "/b/swarm_slave/w/irRQvqQ0/build/android/pylib/local/device/local_device_gtest_run.py", line 363, in _RunTest
timeout=timeout, retries=0)
File "/b/swarm_slave/w/irRQvqQ0/build/android/pylib/local/device/local_device_gtest_run.py", line 154, in Run
self._component, extras=extras, raw=False, **kwargs)
File "/b/swarm_slave/w/irRQvqQ0/third_party/catapult/devil/devil/android/decorators.py", line 57, in timeout_retry_wrapper
retry_if_func=retry_if_func)
File "/b/swarm_slave/w/irRQvqQ0/third_party/catapult/devil/devil/utils/timeout_retry.py", line 159, in Run
error_log_func=error_log_func)
File "/b/swarm_slave/w/irRQvqQ0/third_party/catapult/devil/devil/utils/reraiser_thread.py", line 186, in JoinAll
self._JoinAll(watcher, timeout)
File "/b/swarm_slave/w/irRQvqQ0/third_party/catapult/devil/devil/utils/reraiser_thread.py", line 151, in _JoinAll
(len(alive_threads), len(self._threads)))
CommandTimeoutError: Timed out waiting for 1 of 1 threads.
This means one or more of the tests are taking longer than they once did or a test that takes a long time has been added.
We can do several things to try and fix this. Increase the timeout, find out what CL introduced it taking longer(probably the hardest one to do), or decrease the number of tests per shard.
John is out today, so adding mikecase@ as well.
,
Oct 18 2016
The right thing to do here is the hard one: figure out what's taking so long and fix it. (and perhaps try to make it easier to identify such issues)
,
Oct 21 2016
None of the last 200 runs have red content_unittests steps. Is this still an issue, and if it is is it still a p1 issue?
,
Oct 24 2016
Removing Infra-Troopers since it hasn't happened since the 18th.
,
May 12 2017
Detected 3 new flakes for test/step "content_unittests (with patch)". To see the actual flakes, please visit https://chromium-try-flakes.appspot.com/all_flake_occurrences?key=ahVzfmNocm9taXVtLXRyeS1mbGFrZXNyKQsSBUZsYWtlIh5jb250ZW50X3VuaXR0ZXN0cyAod2l0aCBwYXRjaCkM. This message was posted automatically by the chromium-try-flakes app. Since flakiness is ongoing, the issue was moved back into Sheriff Bug Queue (unless already there).
,
May 15 2017
The latest failures are "shard #0 expired, not enough capacity" |
|||||||||
►
Sign in to add a comment |
|||||||||
Comment 1 by kulshin@chromium.org
, Oct 13 2016Status: WontFix (was: Untriaged)