Back To Schedule
Monday, November 18 • 3:40pm - 4:10pm
Reliable Observability at Scale: Error Budgets for 1,000+ - Fred Moyer, Zendesk

Sign up or log in to save this to your schedule, view media, leave feedback and see who's attending!

Feedback form is now closed.
"Observability and reliability engineering have been on a convergent course for several years. Error Budgets joined the reliability lexicon of engineering organizations in 2016 with the release of the SRE book. The intersection of observability and reliability has largely been the domain of specialists for practical implementation. How can one democratize these techniques to put them in the hands of a thousand engineers at once?

At Zendesk we developed simple algorithms and practical approaches for implementing SLIs, SLOs, and Error Budgets at scale using a number of observability tools. This talk will show the approaches developed and how we were able to manage observability instrumentation across dozens of teams quickly in a complex ecosystem (CDN, UI, middleware, backend, queues, dbs, queues, etc)."

avatar for Fred Moyer

Fred Moyer

Staff Site Reliability Engineer, Zendesk
SLOgician, bitmasks&, C/Perl/Ruby/Go/blablabla. Staff SRE at Zendesk. Likes TSDBs, operational telemetry, mountain biking, high cardinality. Previously Circonus and Turnitin. 2018 Google Istio Developer award, 2013 Perl White Camel award.

Monday November 18, 2019 3:40pm - 4:10pm PST
Marriott Marquis San Diego Marina - San Diego Room B/C
  • Session Slides Included Yes