youtube image
From YouTube: Large Scale Batch Processing with Argo Work... Rakesh Subramanian Suresh & Saravanan Balasubramanian

Description

Large Scale Batch Processing with Argo Workflows and Events - Rakesh Subramanian Suresh & Saravanan Balasubramanian, Intuit

Argo Workflows is Kubernetes native workflow engine, it is a good fit for batch processing jobs running in Kubernetes. In Intuit, our Batch Processing Platform manages ~40,000 data processing pipelines by using Argo Workflows, 10% of these pipelines are running concurrently each every day. All of these pipelines are scheduled or triggered in an event-driven fashion by using Argo Events, that the workflow executions are based on calendar events, or a logical combination of calendar events, status of upstream pipeline workflows, or events in data like file/database update, REST calls, etc. In this session, we will discuss how we run 40K pipeline workflows in Kubernetes clusters, and share how we use Argo Events to manage ~400,000 different sorts of events for pipeline workflows, and achieve exactly-once triggering.