New issue
Advanced search Search tips
Note: Color blocks (like or ) mean that a user may not be available. Tooltip shows the reason.

Issue 739428 link

Starred by 3 users

Issue metadata

Status: Fixed
Owner:
Last visit > 30 days ago
Closed: Jul 2017
Cc:
Components:
EstimatedDays: ----
NextAction: ----
OS: ----
Pri: 1
Type: Bug

Blocking:
issue 729119



Sign in to add a comment

devserver "health check" alert firing for devservers across the board

Project Member Reported by akes...@chromium.org, Jul 5 2017

Issue description

Alerts firing for a wide variety of devservers. Problems:

1) It is too hard to identify from the alerts that have an IP address which devserver is responsible.

2) The alert links to a dashboard that shows failure rate (per minute) rather than failure fraction, which is what is actually alerted on.

I'm not yet sure if the alerts are meaningful. Filing this for now for follow-up.
 
Blocking: 729119
Issue 739904 has been merged into this issue.
Cc: krisr@chromium.org android-comms---labs@google.com
Owner: dschimmels@chromium.org

Comment 5 by jashur@chromium.org, Jul 10 2017

Owner: jashur@chromium.org
Status: Assigned (was: Untriaged)
I took care of this. We were waiting for a cl to go through over the weekend to remove the old vmware (dup) mac address. The devservers are now online and you should no longer see any faults.

jashur@jashur:~$ ping chromeos9-infra-devserver
PING chromeos9-infra-devserver.cros.corp.google.com (100.115.99.252) 56(84) bytes of data.
64 bytes from 100.115.99.252: icmp_seq=1 ttl=62 time=0.543 ms
64 bytes from 100.115.99.252: icmp_seq=2 ttl=62 time=0.514 ms
64 bytes from 100.115.99.252: icmp_seq=3 ttl=62 time=0.517 ms

jashur@jashur:~$ ping chromeos9-infra-devserver1
PING chromeos9-infra-devserver1.cros.corp.google.com (100.115.99.251) 56(84) bytes of data.
64 bytes from 100.115.99.251: icmp_seq=1 ttl=62 time=0.645 ms
64 bytes from 100.115.99.251: icmp_seq=2 ttl=62 time=0.649 ms
64 bytes from 100.115.99.251: icmp_seq=3 ttl=62 time=0.662 ms

Comment 6 by jashur@chromium.org, Jul 10 2017

Status: Fixed (was: Assigned)

Sign in to add a comment