New issue
Advanced search Search tips

Issue 857512 link

Starred by 1 user

Issue metadata

Status: Assigned
Owner:
Cc:
Components:
EstimatedDays: ----
NextAction: ----
OS: ----
Pri: 1
Type: Bug



Sign in to add a comment

Fix current issues that cause Milo to regularly respond with 500 on user-facing endpoints

Project Member Reported by qyears...@chromium.org, Jun 28 2018

Issue description

Yesterday LuciMiloHTML5xxRateHigh fired, and it appears if it was enabled as a page alert now it might fire regularly for several routes on Milo.

Before making this alert a paging alert, we should resolve current issues. "Resolving issues" may in some cases mean returning a 4xx response instead of 5xx.

Some such endpoints may include:

/b/:id	
/p/:project/builders/:bucket/:builder/:numberOrId
/p/:project/g/:group/console
/swarming/task/:id/steps/*logname

The alert yesterday:
https://groups.google.com/a/google.com/forum/#!topic/chops-foundation-alerts/cYGbdRi10nQ

 
Cc: efoo@chromium.org estaab@chromium.org
+ estaab/efoo to track and triage
i've silenced the alert because it produces too much spam. Please unsilence or adjust the alert if you disagree. http://silence/2749219513504366592

Here is the graph of human-visible HTTP 500s http://shortn/_pcmZqsi7sF

Labels: -Pri-2 Pri-1
Owner: hinoka@chromium.org
Status: Assigned (was: Available)
500s should be rare and indicate real problems. I looked through pantheon and see a bunch of "console not found" errors for the /p/:project/g/:group/console URLs that aren't being reported as 404s, as well as a gitiles timeout and a request timeout. It would be good to clean these up so we can see when something is broken.

Sign in to add a comment