youtube image
From YouTube: Reducing Mean-Time-to-Detection of Incidents with an Envoy Service Mesh - Constance Caramanolis

Description

Join us for Kubernetes Forums Seoul, Sydney, Bengaluru and Delhi - learn more at kubecon.io

Don't miss KubeCon + CloudNativeCon 2020 events in Amsterdam March 30 - April 2, Shanghai July 28-30 and Boston November 17-20! Learn more at kubecon.io. The conference features presentations from developers and end users of Kubernetes, Prometheus, Envoy, and all of the other CNCF-hosted projects

Reducing Mean-Time-to-Detection of Incidents with an Envoy Service Mesh - Constance Caramanolis, Lyft

Incident management is inherently stressful and is made worse when the diagnostics and observability data is lacking and heterogenous. Lyft runs Envoy at every hop of the network providing best in class observability across the entirety of Lyft’s network topology. Homogenous data reduces the time it takes to identify production issues. This talk will simulate a production incident at Lyft and guide the attendees through a page from the dreaded PagerDuty notification to resolution, by showing how engineers use Envoy’s extensive observability to identify and root cause the incident and remedy the situation, thus reducing mean time to resolution.

To learn more: https://sched.co/GraX