New issue
Advanced search Search tips
Starred by 1 user

Issue metadata

Status: WontFix
Closed: Sep 2017
EstimatedDays: ----
NextAction: ----
OS: Chrome
Pri: 2
Type: Bug

Sign in to add a comment

Issue 766259: buildstart stage failing with IntegrityError

Reported by, Sep 18 2017 Project Member

Issue description

Chrome Version: ToT
OS: Chrome OS

All toolchain builders failed with: 
@@@STEP_LINK@Builder documentation@
10:01:42: INFO: Running cidb query on pid 19869, repr(query) starts with 'SELECT NOW()'
10:01:42: INFO: Running cidb query on pid 19869, repr(query) starts with <sqlalchemy.sql.expression.Insert object at 0x7f846a485590>
10:01:42: ERROR: Error: (IntegrityError) (1062, "Duplicate entry '8968103418167764992' for key 'buildbucket_id_index'") 'INSERT INTO `buildTable` (master_build_id, buildbot_generation, builder_name, waterfall, build_number, build_config, bot_hostname, start_time, deadline, important, buildbucket_id) VALUES (%s, %s, %s, %s, %s, %s, %s, CURRENT_TIMESTAMP, %s, %s, %s)' (1860550, 1, 'amd64-llvm-next-toolchain', 'chromeos', 388, u'amd64-llvm-next-toolchain', 'cros-beefy272-c2.c.chromeos-bot.internal', datetime.datetime(2017, 9, 19, 6, 53, 24), True, '8968103418167764992')
 If the buildbucket_id to insert is duplicated to the buildbucket_id of an old build and the old build was canceled because of a waterfall master restart, please ignore this error. Else, the error needs more investigation. More context:  and 

What steps will reproduce the problem?

Here is an example:


all the toolchain builders failed with that this morning.

I see the error message refers to this issue: 

And I can see that previous iteration of the builders was "interrupted" (purple color) so maybe I should be ignoring this error.

But, it does not sound right to ignore. The builder yesterday was "interrupted" and the one today failed because of this error. I don't think this should be expected behavior. It is just another day of testing that was not done. So, every interrupt on one day means a failure on the next day?

assigning to Sheriff (akeshet) for clarification.

Comment 1 by, Sep 18 2017

I believe there was a waterfall restart this morning. That may have caused buildbot to "forget" about one of its previous ongoing builds, which could have caused this issue.

If this happens again on the next build, warrants further investigation. Otherwise I believfe it should resolve on its own.

Comment 2 by, Sep 18 2017

Labels: -Pri-1 Pri-2

Comment 3 by, Sep 18 2017


Comment 4 by, Sep 25 2017

Status: WontFix (was: Assigned)
we can close this now, it was a corner case.

Comment 5 by, Oct 23 2017

Labels: Hotlist-CrOS-Sheriffing
This issue occurred again. All paladins failed or stopped at exception. The previous master-paladin build failed to clean up ( Maybe it's the reason.
Here is the example of the failure build:

I Will keep eyes on it if it's flaky failure or not.

Comment 6 by, Oct 23 2017

Re #5, yes it's a flaky failure.

Comment 7 by, Oct 23 2017

This happens (rarely) when buildbot forgets about a previous build and re-uses its buildbot #.

Comment 8 by, Jan 4 2018

Components: -Infra>Client>ChromeOS

Sign in to add a comment