Our users are Chrome developers. Sometimes, our infrastructure fails them. When that happens, it would be nice if we had somewhere they could go when they’re wondering “Is this service down for everyone or just me?” We would like our eng resident to build a status dashboard displaying indicators of the health of each of our services. This project will involve collaborating with Chrome Operations developers and Site Reliability Engineers. Experience with Go, web development, or monitoring is a plus but is not required.
Another important feature is, if a service is red, do we know that someone is working on this? This might be accomplished by surfacing P0 issues in the trooper queue via the Monorail API.
We're hoping to get an eng resident to help out with this project.
Comment 1 by katthomas@google.com
, Mar 13 2017Status: Started (was: Available)