Retry a build if it failed to start cbuildbot |
||||||||
Issue descriptionhttps://uberchromegw.corp.google.com/i/chromeos/builders/master-paladin/builds/12569 This master failed because amd64-generic-paladin did not start. https://build.chromium.org/p/chromiumos/builders/amd64-generic-paladin/builds/27404 amd64-generic-paladin didn't start because it failed at bot-update stage before it reached the cbuildbot stage. One improvement we can do is to use buildbucket to retry this build. Ideas?
,
Oct 11 2016
,
Nov 8 2016
The following revision refers to this bug: https://chromium.googlesource.com/chromiumos/chromite/+/30c14d78eada0c9e72911f03a32dca1786ac16a3 commit 30c14d78eada0c9e72911f03a32dca1786ac16a3 Author: Ningning Xia <nxia@chromium.org> Date: Tue Nov 01 23:30:52 2016 Add unscheduled_slaves metadata. Add unschduled slaves to unscheduled_slaves metadata. 1) record unscheduled slaves. 2) can reschedule the slaves when retrying builds are implemented. BUG= chromium:654481 TEST=unit_tests Change-Id: I95b12b3283f40e7514c0f9ad86cd6ac20e15576d Reviewed-on: https://chromium-review.googlesource.com/406488 Commit-Ready: Ningning Xia <nxia@chromium.org> Tested-by: Ningning Xia <nxia@chromium.org> Reviewed-by: Paul Hobbs <phobbs@google.com> [modify] https://crrev.com/30c14d78eada0c9e72911f03a32dca1786ac16a3/cbuildbot/stages/sync_stages_unittest.py [modify] https://crrev.com/30c14d78eada0c9e72911f03a32dca1786ac16a3/cbuildbot/stages/sync_stages.py
,
Nov 8 2016
The following revision refers to this bug: https://chromium.googlesource.com/chromiumos/chromite/+/5fe75da17c126a41a894efdf1c539491c63b9743 commit 5fe75da17c126a41a894efdf1c539491c63b9743 Author: Ningning Xia <nxia@chromium.org> Date: Thu Nov 03 22:57:01 2016 Add RetryBuildRequest in buildbucket_lib. To support build retry using Buildbucket, add RetryBuildRequest in buildbucket_lib. BUG= chromium:654481 TEST=unit_tests Change-Id: I1c9c0abc9371982583b5922b3b51d8a5567b4a81 Reviewed-on: https://chromium-review.googlesource.com/407981 Commit-Ready: Ningning Xia <nxia@chromium.org> Tested-by: Ningning Xia <nxia@chromium.org> Reviewed-by: Aviv Keshet <akeshet@chromium.org> [modify] https://crrev.com/5fe75da17c126a41a894efdf1c539491c63b9743/cbuildbot/buildbucket_lib_unittest.py [modify] https://crrev.com/5fe75da17c126a41a894efdf1c539491c63b9743/cbuildbot/buildbucket_lib.py
,
Dec 3 2016
The following revision refers to this bug: https://chromium.googlesource.com/chromiumos/chromite/+/44b887e0c055f66c81ead8c0d61e91b0e8d9b99d commit 44b887e0c055f66c81ead8c0d61e91b0e8d9b99d Author: Ningning Xia <nxia@chromium.org> Date: Tue Nov 29 00:29:11 2016 Pass config and metadata to BuildSpecsManager. BuildSpecsManager will need to get build statuses from Buildbucket and retry builds; meanwhile, it needs to update the 'scheduled_slaves' metadata with the new buildbucket_id and create_ts. Pass config (config.name determines whether to get slave statues from Buildbucket) and metadata to BuildSpecsManager. BUG= chromium:654481 TEST=unit_tests Change-Id: Ied7913a54561a1f869b1f1e1be7add587de338e2 Reviewed-on: https://chromium-review.googlesource.com/414235 Commit-Ready: Ningning Xia <nxia@chromium.org> Tested-by: Ningning Xia <nxia@chromium.org> Reviewed-by: Aviv Keshet <akeshet@chromium.org> [modify] https://crrev.com/44b887e0c055f66c81ead8c0d61e91b0e8d9b99d/lib/metadata_lib.py [modify] https://crrev.com/44b887e0c055f66c81ead8c0d61e91b0e8d9b99d/cbuildbot/manifest_version_unittest.py [modify] https://crrev.com/44b887e0c055f66c81ead8c0d61e91b0e8d9b99d/cbuildbot/stages/completion_stages.py [modify] https://crrev.com/44b887e0c055f66c81ead8c0d61e91b0e8d9b99d/cbuildbot/manifest_version.py [modify] https://crrev.com/44b887e0c055f66c81ead8c0d61e91b0e8d9b99d/cbuildbot/stages/sync_stages.py [modify] https://crrev.com/44b887e0c055f66c81ead8c0d61e91b0e8d9b99d/cbuildbot/stages/completion_stages_unittest.py [modify] https://crrev.com/44b887e0c055f66c81ead8c0d61e91b0e8d9b99d/cbuildbot/buildbucket_lib.py [modify] https://crrev.com/44b887e0c055f66c81ead8c0d61e91b0e8d9b99d/cbuildbot/buildbucket_lib_unittest.py [modify] https://crrev.com/44b887e0c055f66c81ead8c0d61e91b0e8d9b99d/lib/constants.py [modify] https://crrev.com/44b887e0c055f66c81ead8c0d61e91b0e8d9b99d/cbuildbot/lkgm_manager.py [modify] https://crrev.com/44b887e0c055f66c81ead8c0d61e91b0e8d9b99d/lib/metadata_lib_unittest.py
,
Dec 3 2016
The following revision refers to this bug: https://chromium.googlesource.com/chromiumos/chromite/+/4904985f59d4ea9a9ce6a07290474838a293385c commit 4904985f59d4ea9a9ce6a07290474838a293385c Author: Ningning Xia <nxia@chromium.org> Date: Wed Nov 23 00:50:37 2016 GetScheduledBuildDict returns build->build_info map. Previously GetScheduledBuildDict only returned build->buildbucket_id map. In order to support retry build with limit times, keep track of the retry times and return build->build_info map. BUG= chromium:654481 TEST=unit_tests Change-Id: I4f8a639a7704fb07325b996c8058779b7011edcc Reviewed-on: https://chromium-review.googlesource.com/414409 Commit-Ready: Ningning Xia <nxia@chromium.org> Tested-by: Ningning Xia <nxia@chromium.org> Reviewed-by: Aviv Keshet <akeshet@chromium.org> [modify] https://crrev.com/4904985f59d4ea9a9ce6a07290474838a293385c/cbuildbot/manifest_version_unittest.py [modify] https://crrev.com/4904985f59d4ea9a9ce6a07290474838a293385c/cbuildbot/stages/completion_stages.py [modify] https://crrev.com/4904985f59d4ea9a9ce6a07290474838a293385c/cbuildbot/manifest_version.py [modify] https://crrev.com/4904985f59d4ea9a9ce6a07290474838a293385c/cbuildbot/stages/completion_stages_unittest.py [modify] https://crrev.com/4904985f59d4ea9a9ce6a07290474838a293385c/cbuildbot/buildbucket_lib_unittest.py [modify] https://crrev.com/4904985f59d4ea9a9ce6a07290474838a293385c/cbuildbot/buildbucket_lib.py
,
Dec 6 2016
The following revision refers to this bug: https://chromium.googlesource.com/chromiumos/chromite/+/f8559ead0dd7f71d003823c760ca190ddd9e7f24 commit f8559ead0dd7f71d003823c760ca190ddd9e7f24 Author: Ningning Xia <nxia@chromium.org> Date: Thu Dec 01 00:12:54 2016 Retry builds which failed before Cbuildbot step. For builds which fail before reaching the BuildStartStage, no corresponding build_ids would be inserted into CIDB. We can safely retry those failed builds and update the 'scheduled_slaves' in the master metadata. BUG= chromium:654481 TEST=unit_tests Change-Id: Ida993c7abb90af008c0a7ff3d3e3c894d0c0934e Reviewed-on: https://chromium-review.googlesource.com/415556 Commit-Ready: Ningning Xia <nxia@chromium.org> Tested-by: Ningning Xia <nxia@chromium.org> Reviewed-by: Ningning Xia <nxia@chromium.org> [modify] https://crrev.com/f8559ead0dd7f71d003823c760ca190ddd9e7f24/lib/constants.py [modify] https://crrev.com/f8559ead0dd7f71d003823c760ca190ddd9e7f24/cbuildbot/manifest_version.py [modify] https://crrev.com/f8559ead0dd7f71d003823c760ca190ddd9e7f24/cbuildbot/manifest_version_unittest.py
,
Dec 14 2016
,
Jan 24 2017
An example of a successful retry. https://uberchromegw.corp.google.com/i/chromeos/builders/master-paladin/builds/13446/steps/CommitQueueCompletion/logs/stdio 09:56:41: INFO: Still waiting for the following builds to complete: ['beaglebone-paladin'] 09:56:41: INFO: Going to retry build beaglebone-paladin buildbucket_id 8989573633939369200 with retry # 1 09:56:41: INFO: Refreshing due to a 401 (attempt 1/2) 09:56:41: INFO: Refreshing access_token 09:56:42: INFO: Retried build beaglebone-paladin buildbucket_id 8989564123415132752 created_ts 1485280602305290 09:56:42: INFO: 1:55:59.605314 until timeout...
,
Mar 4 2017
,
Apr 17 2017
,
May 30 2017
,
Aug 1 2017
,
Oct 14 2017
|
||||||||
►
Sign in to add a comment |
||||||||
Comment 1 by dgarr...@chromium.org
, Oct 10 2016