[kefka] paygen_au tests regularly lose ssh connection |
|||||||||
Issue descriptionSkate and kefka paygen_au tests are failing continuously on every stable RC. No luck even 5 or 6 times reruns. GE: https://cros-goldeneye.corp.google.com/chromeos/console/qaRelease?releaseName=M64-STABLE-CHROMEOS-6 Kefka: https://ubercautotest.corp.google.com/new_tko/#tab_id=spreadsheet_view&row=job_name%252Ctest_name%252Cplatform&column=status&show_incomplete=true&show_only_latest=false&show_invalid=true&condition=job_name%2520LIKE%2520%27kefka-release/R64-10176.76.0/paygen%255C_au%255C_stable/%2525%27%2520AND%2520test_name%2520%253C%253E%2520%27SERVER_JOB%27%2520AND%2520test_name%2520NOT%2520LIKE%2520%27CLIENT%255C_JOB%2525%27%2520AND%2520platform%2520LIKE%2520%27kefka%27 Skate: https://ubercautotest.corp.google.com/new_tko/#tab_id=spreadsheet_view&row=job_name%252Ctest_name%252Cplatform&column=status&show_incomplete=true&show_only_latest=false&show_invalid=true&condition=job_name%2520LIKE%2520%27daisy%255C_skate-release/R64-10176.76.0/paygen%255C_au%255C_stable/%2525%27%2520AND%2520test_name%2520%253C%253E%2520%27SERVER_JOB%27%2520AND%2520test_name%2520NOT%2520LIKE%2520%27CLIENT%255C_JOB%2525%27%2520AND%2520platform%2520LIKE%2520%27daisy_skate%27
,
Feb 27 2018
We are passing kefka and skate in https://cros-goldeneye.corp.google.com/chromeos/console/qaRelease?releaseName=M64-STABLE-CHROMEOS-6, is that ok ?
,
Feb 27 2018
Manual tests are passed. GOOD to GO now.
,
Dec 11
kefka still fails regularly for ssh connection issues It happens at all stages of the update (before it starts, after rebooting, before stateful, after stateful)
,
Dec 11
Assigning to infra deputy https://stainless.corp.google.com/search?view=matrix&row=board_model&col=build&first_date=2018-12-06&last_date=2018-12-10&suite=%5Epaygen_au_dev&board=kefka&exclude_cts=false&exclude_not_run=false&exclude_non_release=true&exclude_au=false&exclude_acts=true&exclude_retried=true&exclude_non_production=false
,
Dec 13
checking kekfa status: 6 bvt 1-Provisioning 0-Ready 0-Repair Failed 0-Repairing 2-Resetting 3-Running 0-Locked 28 cts 1-Provisioning 0-Ready 0-Repair Failed 4-Repairing 2-Resetting 21-Running 0-Locked 1 cts-perbuild 0-Provisioning 0-Ready 0-Repair Failed 1-Repairing 0-Resetting 0-Running 0-Locked 15 suites 0-Provisioning 7-Ready 4-Repair Failed 4-Repairing 0-Resetting 0-Running 5-Locked 4 of the 5 locked are pending replacement https://screenshot.googleplex.com/O2e0kAi6BeL 4 repair failed 4 repairing that leave 7 ready, that should be sufficient for suites. We might just check and nuke the 8 unit that are under repair and see what's happening.
,
Dec 13
regards to c#6, I am talking about the suites pool about the locked and repair devices.
,
Dec 13
,
Dec 14
Note: blocking wizpig from M71 stable
,
Dec 14
Ping since this is blocking wizpig from getting tested and included in the M71 stable; thanks! Is the owner OOO as monorail is calling out?
,
Dec 14
,
Jan 12
Issue 776596 has been merged into this issue.
,
Jan 12
assigned to next infra deputy
,
Jan 15
,
Jan 16
(6 days ago)
,
Jan 16
(6 days ago)
Aviv is deputy this week.
,
Jan 18
(5 days ago)
Is there any evidence that this is a lab infra issue as opposed to flakiness in kefka itself? |
|||||||||
►
Sign in to add a comment |
|||||||||
Comment 1 by dhadd...@chromium.org
, Feb 26 2018Status: Assigned (was: Untriaged)