BVT CQs consistently aborting on gnawty board for R50-7978.74.0 |
||
Issue descriptionDescribe problem with existing test, or request for new test. The gnawty-release/R50-7978.74.0/bvt-cq tests are getting aborted across the board. please navigate to cautotest/ and filter to owner=ketakid and Name=gnawty-release/R50-7978.74.0/bvt-cq and click filter. You should all tests either aborted or queued. There were at least 3-4 attempts made to re-run these tests but they are still not succeeding. I don't have any logs to go off. We need to investigate what is going on here. Adding sheriffs and deputy. Set Cr-BVT-CQ Set Test-? to type of test.
,
May 13 2016
The first of those two seems to be running just fine. http://cautotest/afe/#tab_id=view_job&object_id=63212560 It still has some queued jobs. I notice that in its debug log (https://pantheon.corp.google.com/m/cloudstorage/b/chromeos-autotest-results/o/63212560-ketakid/hostless/debug/autoserv.DEBUG) that it was run in a mode where the suite job doesn't wait on child jobs. Was this intentional? http://cautotest/afe/#tab_id=view_job&object_id=63109569 (job log: https://pantheon.corp.google.com/m/cloudstorage/b/chromeos-autotest-results/o/63109569-ketakid/hostless/debug/autoserv.DEBUG) I see that its child jobs were all aborted. Picking an arbitrary one http://cautotest/afe/#tab_id=view_job&object_id=63109573 seems like it was aborted before ever starting, my guess is due to timeout.
,
May 13 2016
What exact command line was used to launch these suites?
,
May 13 2016
I guess the last (4th... or so) re-run of the suite succeeded. If you look back at the previous runs they were aborting across the board. This is the script we use to re-run entire suites. For individual tests we use catotest/ for BOARD in $BOARDS ; do /usr/local/google/home/chromeos-re/paygen/buildtools/src/third_party/autotest/files/site_utils/run_suite.py --board $BOARD --build $BOARD-release/R48-7647.70.0 --suite_name bvt-cq --file_bugs True --pool bvt --no_wait True --priority CQ --timeout_mins 600 --retry True --num 6; done
,
May 13 2016
You shouldn't use --no_wait, that causes the suite job to end without waiting for results, which is why the suite job logs in #2 are uninformative. If all you want is for run_suite to return immediately, use -c instead.
,
May 13 2016
With only the suite job logs that were run with "--no_wait" this is basically impossible to diagnose in detail. |
||
►
Sign in to add a comment |
||
Comment 1 by akes...@chromium.org
, May 13 2016