youtube image
From YouTube: War Stories with Kubernetes in Production

Description

Rudderstack is an (open source) data pipeline company that helps send data from any source to any destination. In this talk, We want to start by explaining the Rudderstack architecture which is used to send data from any source to any destination that is similar to Kubernetes (ControlPlane/DataPlane), and how we deploy everything on Kubernetes using aws EKS. Following this focus is on giving a few case studies related to our incidents with Kubernetes and what actions we took to prevent the same to happen in the future.
Some examples are mentioned below: 1. How we took down the Kubernetes cluster by deleting config map 'aws-auth' that manages complete access management with EKS. Explain how we prevented this from happening again using the OPA gatekeeper. 2. How the EC2 worker node going down caused some downtime. Explain our runbook and how we improved the time to recover. We like to cover 3-4 such cases with our timeline of recovery and what action we took to prevent this from happening or improve recovery time.