youtube image
From YouTube: Stories from the Playbook - Tina Zhang & Fred van den Driessche, Google (Any Skill Level)

Description

Want to view more sessions and keep the conversations going? Join us for KubeCon + CloudNativeCon North America in Seattle, December 11 - 13, 2018 (http://bit.ly/KCCNCNA18) or in Shanghai, November 14-15 (http://bit.ly/kccncchina18).

Stories from the Playbook - Tina Zhang & Fred van den Driessche, Google (Any Skill Level)

Have you ever wondered how GKE Site Reliability Engineers (SRE) manage an entire fleet of GKE clusters in 15 regions around the world? This talk provides an overview on how the SRE team approach this challenge, what tools are used, the problems encountered and war stories/learning experiences. The talk introduces the most frequently used parts of our playbook and how SRE endeavours to save your cluster while oncall in an effort to meet our SLOs.

"About Tina
Tina joined the Google as a Site Reliability Engineer for GKE in March 2017 and has primarily been working on delivering High Availability Masters in GKE, bringing GKE to more cloud regions and improving monitoring and alerting for the system. Prior to this, she had a previous life as an investment banker at J.P. Morgan. Previous tech speaking experience: Kubecon (US) 2017: What Happens When Something Goes Wrong? On Kubernetes Reliability (co-speaker with Marek Grabowski)

About Fred
Fred is an SRE at Google working on Google Kubernetes Engine, primarily focused on improving system observability, both at single cluster and fleet-wide levels. Previously he worked at Microsoft, writing and wrangling Java web apps for their Yammer product.
Join us for KubeCon + CloudNativeCon in Barcelona May 20 - 23, Shanghai June 24 - 26, and San Diego November 18 - 21! Learn more at https://kubecon.io. The conference features presentations from developers and end users of Kubernetes, Prometheus, Envoy and all of the other CNCF-hosted projects.