2 Jun 2022
Don’t miss out! Join us at our upcoming hybrid event: KubeCon + CloudNativeCon North America 2022 from October 24-28 in Detroit (and online!). Learn more at https://kubecon.io. The conference features presentations from developers and end users of Kubernetes, Prometheus, Envoy, and all of the other CNCF-hosted projects.
Argo’s Vibrant Ecosystem and Community - Alex Collins, Intuit; Alexander Matyushentsev, Akuity; Dan Garfield, Codefresh
Argo CD is the most popular Kubernetes GitOps application delivery tool. Argo Workflows is the most popular Kubernetes workflow execution platform. I bet you know about the core Argo projects, but what about the massive rapidly growing ecosystem of projects around them? Have you heard of Argo Events, Argo Rollouts, ApplicationSet, Argo CD Image Updater, Argo CD Vault Plugin, Argo CD Autopilot, Hera Workflows? What about tools such as Kubeflow Pipelines, Katib, SQL Flow, Couler, and Ploomber?. In this session, you’ll learn not just about the core Argo projects, but also get the highlights from several ecosystem projects with demos and best practices. You can address many more use cases by teaming them up with each other.
Argo’s Vibrant Ecosystem and Community - Alex Collins, Intuit; Alexander Matyushentsev, Akuity; Dan Garfield, Codefresh
Argo CD is the most popular Kubernetes GitOps application delivery tool. Argo Workflows is the most popular Kubernetes workflow execution platform. I bet you know about the core Argo projects, but what about the massive rapidly growing ecosystem of projects around them? Have you heard of Argo Events, Argo Rollouts, ApplicationSet, Argo CD Image Updater, Argo CD Vault Plugin, Argo CD Autopilot, Hera Workflows? What about tools such as Kubeflow Pipelines, Katib, SQL Flow, Couler, and Ploomber?. In this session, you’ll learn not just about the core Argo projects, but also get the highlights from several ecosystem projects with demos and best practices. You can address many more use cases by teaming them up with each other.
- 5 participants
- 30 minutes
2 Jun 2022
Don’t miss out! Join us at our upcoming hybrid event: KubeCon + CloudNativeCon North America 2022 from October 24-28 in Detroit (and online!). Learn more at https://kubecon.io. The conference features presentations from developers and end users of Kubernetes, Prometheus, Envoy, and all of the other CNCF-hosted projects.
Automated Progressive Delivery Using GitOps and Service Mesh - Yasen Simeonov & Henrik Blixt, Intuit
Intuit is rolling out progressive delivery together with service mesh, using metrics from the mesh to automate analysis on the health of a new release. This is our next step towards more sophisticated techniques like AI/ML-driven analysis. As a FinTech Technology company with tens of millions of users and thousands of services, with many being very seasonal, we were facing a wide variety of service performance and traffic patterns that made patterning analysis templates challenging. This session discusses and demos how Intuit leverages Argo Rollouts in combination with Envoy and Service Mesh to provide automatic and declarative progressive delivery. Attendees will learn about the challenges we faced, our learnings and work we had to overcome those challenges Intuit has been heavily involved in building tools for progressive analysis with Argo Rollouts and we are now rolling this out at large scale together with service mesh. This is the story on how and what worked in production.
Automated Progressive Delivery Using GitOps and Service Mesh - Yasen Simeonov & Henrik Blixt, Intuit
Intuit is rolling out progressive delivery together with service mesh, using metrics from the mesh to automate analysis on the health of a new release. This is our next step towards more sophisticated techniques like AI/ML-driven analysis. As a FinTech Technology company with tens of millions of users and thousands of services, with many being very seasonal, we were facing a wide variety of service performance and traffic patterns that made patterning analysis templates challenging. This session discusses and demos how Intuit leverages Argo Rollouts in combination with Envoy and Service Mesh to provide automatic and declarative progressive delivery. Attendees will learn about the challenges we faced, our learnings and work we had to overcome those challenges Intuit has been heavily involved in building tools for progressive analysis with Argo Rollouts and we are now rolling this out at large scale together with service mesh. This is the story on how and what worked in production.
- 8 participants
- 36 minutes
2 Jun 2022
Don’t miss out! Join us at our upcoming hybrid event: KubeCon + CloudNativeCon North America 2022 from October 24-28 in Detroit (and online!). Learn more at https://kubecon.io. The conference features presentations from developers and end users of Kubernetes, Prometheus, Envoy, and all of the other CNCF-hosted projects.
Confidential Containers Explained - James Magowan, IBM & Samuel Ortiz, Apple
Confidential Containers (https://github.com/confidential-containers) enable cloud native confidential computing (https://confidentialcomputing.io/faq/) leveraging a variety of hardware platforms and technologies, standardising the confidential computing at the container level, helping users to deploy confidential workloads using already familiar workflows and tools; and in this Panel we're bringing together contributors from different hardware vendors, different projects (in different layers of the stack), and different companies to discuss and answer questions about this new complex technology. Topics for discussion will include: * How do we realise the benefits of cloud native confidential computing? * What impact is there to the Cloud Native User Experience? * What new considerations/concepts does confidential containers introduce? Note that this panel is a follow-up on what has been developed since https://www.youtube.com/watch?v=zTn9Xt1k1OA was presented.
Confidential Containers Explained - James Magowan, IBM & Samuel Ortiz, Apple
Confidential Containers (https://github.com/confidential-containers) enable cloud native confidential computing (https://confidentialcomputing.io/faq/) leveraging a variety of hardware platforms and technologies, standardising the confidential computing at the container level, helping users to deploy confidential workloads using already familiar workflows and tools; and in this Panel we're bringing together contributors from different hardware vendors, different projects (in different layers of the stack), and different companies to discuss and answer questions about this new complex technology. Topics for discussion will include: * How do we realise the benefits of cloud native confidential computing? * What impact is there to the Cloud Native User Experience? * What new considerations/concepts does confidential containers introduce? Note that this panel is a follow-up on what has been developed since https://www.youtube.com/watch?v=zTn9Xt1k1OA was presented.
- 6 participants
- 38 minutes
2 Jun 2022
Don’t miss out! Join us at our upcoming hybrid event: KubeCon + CloudNativeCon North America 2022 from October 24-28 in Detroit (and online!). Learn more at https://kubecon.io. The conference features presentations from developers and end users of Kubernetes, Prometheus, Envoy, and all of the other CNCF-hosted projects.
Fun with Continuous Compliance - Ann Wallace, Shopify & Zeal Somani, Google
Is it possible to make compliance fun and less stressful? The old way of doing things is to manually gather evidence once or twice a year and hope nothing bad is found during your audit. This is not fun. In this talk, we’ll go over the concepts of continuous compliance and how to apply this to your current DevSecOps program. Zeal will talk about how the Open Security Controls Assessment Language (OSCAL) can be used to create automated control based assessments. Lastly, Ann will walk through how Shopify uses OSS like Falco and Voucher to achieve continuous compliance at scale. You will walk away from this session with information on how you can make compliance fun or at least less painful.
Fun with Continuous Compliance - Ann Wallace, Shopify & Zeal Somani, Google
Is it possible to make compliance fun and less stressful? The old way of doing things is to manually gather evidence once or twice a year and hope nothing bad is found during your audit. This is not fun. In this talk, we’ll go over the concepts of continuous compliance and how to apply this to your current DevSecOps program. Zeal will talk about how the Open Security Controls Assessment Language (OSCAL) can be used to create automated control based assessments. Lastly, Ann will walk through how Shopify uses OSS like Falco and Voucher to achieve continuous compliance at scale. You will walk away from this session with information on how you can make compliance fun or at least less painful.
- 7 participants
- 40 minutes
2 Jun 2022
Don’t miss out! Join us at our upcoming hybrid event: KubeCon + CloudNativeCon North America 2022 from October 24-28 in Detroit (and online!). Learn more at https://kubecon.io. The conference features presentations from developers and end users of Kubernetes, Prometheus, Envoy, and all of the other CNCF-hosted projects.
How Cookpad Leverages Triton Inference Server To Boost Their Model Serving - Jose Navarro & Prayana Galih, Cookpad
The adoption of MLOps practices and tooling by organizations has considerably reduced the pain points to productionise Machine Learning models. However, with the increase of the number of models available by a company to deploy, the diversity of frameworks used to train those models and the different infrastructure required to run each model, new challenges arise for Machine Learning Platform teams e.g: How can we deploy new models from the same or different frameworks concurrently? How can we improve throughput and optimize resource utilization in our serving infrastructure, especially GPUs? Cookpad ML Platform Engineers will talk in this session how Triton Inference Server, an open-source model serving tool from Nvidia, can simplify the process of model deployment and optimise the resource utilisation by efficiently supporting concurrent models on single GPU or CPU, and multi-GPU servers.
How Cookpad Leverages Triton Inference Server To Boost Their Model Serving - Jose Navarro & Prayana Galih, Cookpad
The adoption of MLOps practices and tooling by organizations has considerably reduced the pain points to productionise Machine Learning models. However, with the increase of the number of models available by a company to deploy, the diversity of frameworks used to train those models and the different infrastructure required to run each model, new challenges arise for Machine Learning Platform teams e.g: How can we deploy new models from the same or different frameworks concurrently? How can we improve throughput and optimize resource utilization in our serving infrastructure, especially GPUs? Cookpad ML Platform Engineers will talk in this session how Triton Inference Server, an open-source model serving tool from Nvidia, can simplify the process of model deployment and optimise the resource utilisation by efficiently supporting concurrent models on single GPU or CPU, and multi-GPU servers.
- 5 participants
- 32 minutes
2 Jun 2022
Don’t miss out! Join us at our upcoming hybrid event: KubeCon + CloudNativeCon North America 2022 from October 24-28 in Detroit (and online!). Learn more at https://kubecon.io. The conference features presentations from developers and end users of Kubernetes, Prometheus, Envoy, and all of the other CNCF-hosted projects.
Real World SPIFFE Scenarios and Outcomes - Andres Vega & Frederick Kautz, SPIFFE Steering Committee
SPIFFE aims to strengthen the identification of software components in a common way that can be leveraged across distributed systems by anyone, anywhere. The ability to maintain software security by standardizing how systems define, attest, and maintain software identity, regardless of where systems are deployed or who deploys those systems, confers many benefits. The use of SPIFFE can significantly reduce costs associated with the overhead of managing and issuing cryptographic identity documents and accelerate development by removing the need for developers to understand the complexity involved to secure service-to-service communication, but that is not the only outcome. Production identity can have a positive impact on many areas such as interoperability, compliance, audibility, and more. This presentation demonstrates the real world scenarios and outcomes of deploying SPIFFE across your infrastructure and also using it to bridge and integrate the infrastructure of others.
Real World SPIFFE Scenarios and Outcomes - Andres Vega & Frederick Kautz, SPIFFE Steering Committee
SPIFFE aims to strengthen the identification of software components in a common way that can be leveraged across distributed systems by anyone, anywhere. The ability to maintain software security by standardizing how systems define, attest, and maintain software identity, regardless of where systems are deployed or who deploys those systems, confers many benefits. The use of SPIFFE can significantly reduce costs associated with the overhead of managing and issuing cryptographic identity documents and accelerate development by removing the need for developers to understand the complexity involved to secure service-to-service communication, but that is not the only outcome. Production identity can have a positive impact on many areas such as interoperability, compliance, audibility, and more. This presentation demonstrates the real world scenarios and outcomes of deploying SPIFFE across your infrastructure and also using it to bridge and integrate the infrastructure of others.
- 7 participants
- 49 minutes
2 Jun 2022
Don’t miss out! Join us at our upcoming hybrid event: KubeCon + CloudNativeCon North America 2022 from October 24-28 in Detroit (and online!). Learn more at https://kubecon.io. The conference features presentations from developers and end users of Kubernetes, Prometheus, Envoy, and all of the other CNCF-hosted projects.
Sharing Knowledge: Writing Good Docs for Quick Approval - Jared Bhatti, Waymo
The goal of this talk is to increase your ability to write good documentation that gets approved quickly. Good documentation has a profound impact on the visibility, quality, and inclusivity of open source projects. Documentation creates a shared understanding of work, helps onboard new developers, and improves the overall quality and reliability of the project.
Based on Jared's experience leading Kubernetes SIG Docs from 2016 to 2020, this presentation walks developers through best practices for creating inclusive, accessible, high quality documentation in pull requests designed for quick approval. This demonstration includes how to structure documentation using content templates, write with clarity and technical accuracy, and avoid common pitfalls that trap PRs in prolonged reviews.
Sharing Knowledge: Writing Good Docs for Quick Approval - Jared Bhatti, Waymo
The goal of this talk is to increase your ability to write good documentation that gets approved quickly. Good documentation has a profound impact on the visibility, quality, and inclusivity of open source projects. Documentation creates a shared understanding of work, helps onboard new developers, and improves the overall quality and reliability of the project.
Based on Jared's experience leading Kubernetes SIG Docs from 2016 to 2020, this presentation walks developers through best practices for creating inclusive, accessible, high quality documentation in pull requests designed for quick approval. This demonstration includes how to structure documentation using content templates, write with clarity and technical accuracy, and avoid common pitfalls that trap PRs in prolonged reviews.
- 5 participants
- 38 minutes
2 Jun 2022
Don’t miss out! Join us at our upcoming hybrid event: KubeCon + CloudNativeCon North America 2022 from October 24-28 in Detroit (and online!). Learn more at https://kubecon.io. The conference features presentations from developers and end users of Kubernetes, Prometheus, Envoy, and all of the other CNCF-hosted projects.
Unraveling the Magic Behind Buildpacks - Sambhav Kothari, Bloomberg & Natalie Arellano, VMware
Cloud Native Buildpacks makes building container images a breeze. It comes with out-of-the-box support for rebasing, reproducibility, multiple entrypoints and more! In this talk we’ll uncover the magic that the lifecycle - the binary at the heart of CNB - uses to convert source code into OCI images.
Unraveling the Magic Behind Buildpacks - Sambhav Kothari, Bloomberg & Natalie Arellano, VMware
Cloud Native Buildpacks makes building container images a breeze. It comes with out-of-the-box support for rebasing, reproducibility, multiple entrypoints and more! In this talk we’ll uncover the magic that the lifecycle - the binary at the heart of CNB - uses to convert source code into OCI images.
- 2 participants
- 39 minutes
2 Jun 2022
Don’t miss out! Join us at our upcoming hybrid event: KubeCon + CloudNativeCon North America 2022 from October 24-28 in Detroit (and online!). Learn more at https://kubecon.io. The conference features presentations from developers and end users of Kubernetes, Prometheus, Envoy, and all of the other CNCF-hosted projects.
containerd: Project Update and Deep Dive - Derek McGowan, Apple
Join containerd maintainers for an introduction and deep dive into the latest updates on containerd. With many exciting features currently in development, the upcoming release of container promises to deliver many new capabilities while retaining the stability containerd is known for amongst users. The deprecation of dockershim in Kubernetes has brought many new users to containerd along with greater need for documentation and tooling. We will cover how to get started and configure containerd for Kubernetes users. Additionally, nerdctl has filled a crucial usability gap for operators and developers coming to containerd. We will discuss how to make use of this important new containerd sub-project.
containerd: Project Update and Deep Dive - Derek McGowan, Apple
Join containerd maintainers for an introduction and deep dive into the latest updates on containerd. With many exciting features currently in development, the upcoming release of container promises to deliver many new capabilities while retaining the stability containerd is known for amongst users. The deprecation of dockershim in Kubernetes has brought many new users to containerd along with greater need for documentation and tooling. We will cover how to get started and configure containerd for Kubernetes users. Additionally, nerdctl has filled a crucial usability gap for operators and developers coming to containerd. We will discuss how to make use of this important new containerd sub-project.
- 1 participant
- 21 minutes
1 Jun 2022
Don’t miss out! Join us at our upcoming hybrid event: KubeCon + CloudNativeCon North America 2022 from October 24-28 in Detroit (and online!). Learn more at https://kubecon.io. The conference features presentations from developers and end users of Kubernetes, Prometheus, Envoy, and all of the other CNCF-hosted projects.
Backstage: Restoring Order To Your Chaos - Dave Zolotusky, Spotify
The CNCF Landscape is a complex ecosystem of frameworks, technologies, and platforms. Your teams have their own implementations of these technologies. Onboarding new developers is super confusing, and docs are scattered around the place. In short: chaos. The solution to this might be… another portal? Backstage (https://backstage.io), a platform you can use to build your own developer portal. It is highly customisable and adopted by companies like Expedia, Netflix, American Airlines, and Epic Games. Dave will share why Backstage was developed at Spotify, how it became the core of their developer experience, and a CNCF project. You’ll get a quick tour of Backstage, the plugin ecosystem, and some of the key use cases for Backstage. Then he will dig into the Software Templates feature. How templates can help developers quickly get started with new code repositories. He will also cover how you can create custom templates to have your organizations best practices built-in, right from the start!
Backstage: Restoring Order To Your Chaos - Dave Zolotusky, Spotify
The CNCF Landscape is a complex ecosystem of frameworks, technologies, and platforms. Your teams have their own implementations of these technologies. Onboarding new developers is super confusing, and docs are scattered around the place. In short: chaos. The solution to this might be… another portal? Backstage (https://backstage.io), a platform you can use to build your own developer portal. It is highly customisable and adopted by companies like Expedia, Netflix, American Airlines, and Epic Games. Dave will share why Backstage was developed at Spotify, how it became the core of their developer experience, and a CNCF project. You’ll get a quick tour of Backstage, the plugin ecosystem, and some of the key use cases for Backstage. Then he will dig into the Software Templates feature. How templates can help developers quickly get started with new code repositories. He will also cover how you can create custom templates to have your organizations best practices built-in, right from the start!
- 4 participants
- 35 minutes
1 Jun 2022
Don’t miss out! Join us at our upcoming hybrid event: KubeCon + CloudNativeCon North America 2022 from October 24-28 in Detroit (and online!). Learn more at https://kubecon.io. The conference features presentations from developers and end users of Kubernetes, Prometheus, Envoy, and all of the other CNCF-hosted projects.
CNCF TAG-Runtime: Cloud Native Open Source Core Components - Alex Scammon, G-Research; Zbynek Roubalik, Red Hat; Ricardo Aravena, Rakuten; Samuel Ortiz, Apple
Learn about the CNCF open source projects that allow users to run cloud native workloads! This session will cover: 1) Overview of the TAG-Runtime, how to join, and how to get involved. 2) Update of working groups (new, existing, and potential) within the scope of the TAG . 3) How the TAG provides advise to the CNCF TOC. 4) Future trends for cloud native runtime technologies in the TAG scope such as containers, Virtual Machines, Edge/MLOps and WebAssembly.
CNCF TAG-Runtime: Cloud Native Open Source Core Components - Alex Scammon, G-Research; Zbynek Roubalik, Red Hat; Ricardo Aravena, Rakuten; Samuel Ortiz, Apple
Learn about the CNCF open source projects that allow users to run cloud native workloads! This session will cover: 1) Overview of the TAG-Runtime, how to join, and how to get involved. 2) Update of working groups (new, existing, and potential) within the scope of the TAG . 3) How the TAG provides advise to the CNCF TOC. 4) Future trends for cloud native runtime technologies in the TAG scope such as containers, Virtual Machines, Edge/MLOps and WebAssembly.
- 11 participants
- 33 minutes
1 Jun 2022
Don’t miss out! Join us at our upcoming hybrid event: KubeCon + CloudNativeCon North America 2022 from October 24-28 in Detroit (and online!). Learn more at https://kubecon.io. The conference features presentations from developers and end users of Kubernetes, Prometheus, Envoy, and all of the other CNCF-hosted projects.
Choosing Cloud Native Technologies for the Journey to Multi-cloud - Adelina Simion, Form3
Building, deploying and maintaining systems has become increasingly more complicated in recent years. Now, as engineers look toward migrating to multi-cloud architectures, systems and processes may need to be migrated to new technologies. But what choices are available, how do they fit together and how can the CNCF landscape help? This talk discusses the cloud native technologies that can be used to convert to a multi-cloud architecture and highlights some of the lessons learned from taking this journey on at Form3. The audience will learn: - How to decide if multi-cloud is essential for them - The fundamentals of deploying services across multiple clouds with Kubernetes - How to leverage Cilium to mesh together multiple clusters - The basics of event sourcing using NATS in the multi-cloud world - Resilient and performant data storage using CockroachDB This talk is useful for any new comers to the cloud native landscape, as well as those curious about going multi-cloud!
Choosing Cloud Native Technologies for the Journey to Multi-cloud - Adelina Simion, Form3
Building, deploying and maintaining systems has become increasingly more complicated in recent years. Now, as engineers look toward migrating to multi-cloud architectures, systems and processes may need to be migrated to new technologies. But what choices are available, how do they fit together and how can the CNCF landscape help? This talk discusses the cloud native technologies that can be used to convert to a multi-cloud architecture and highlights some of the lessons learned from taking this journey on at Form3. The audience will learn: - How to decide if multi-cloud is essential for them - The fundamentals of deploying services across multiple clouds with Kubernetes - How to leverage Cilium to mesh together multiple clusters - The basics of event sourcing using NATS in the multi-cloud world - Resilient and performant data storage using CockroachDB This talk is useful for any new comers to the cloud native landscape, as well as those curious about going multi-cloud!
- 4 participants
- 27 minutes
1 Jun 2022
Don’t miss out! Join us at our upcoming hybrid event: KubeCon + CloudNativeCon North America 2022 from October 24-28 in Detroit (and online!). Learn more at https://kubecon.io. The conference features presentations from developers and end users of Kubernetes, Prometheus, Envoy, and all of the other CNCF-hosted projects.
Distributing PromQL for Fast and Efficient Kubernetes Fleet Monitoring - Moad Zardab, Red Hat & Filip Petkovski, Shopify
Both Thanos and Cortex have enabled the cloud native ecosystem to scale Prometheus storage with the use of blocks of data persisted across many clusters into single object storage. Whilst this unlocks cheap long term retention of metrics, it presents a significant challenge of being able to efficiently read and process large volumes of data. This talk outlines the Thanos community's efforts to improve read path performance through query pushdown and query sharding and how it compares with existing Cortex approaches. Thanos deployment's are composed of stores; components that expose a consistent Prometheus compliant read API for retrieving timeseries, and queriers; components that combine raw timeseries and evaluate PromQL expressions against them. Query pushdown gives the opportunity to pre-evaluate these expressions closer to the data, while query sharding breaks down a query into distinct, disassociated datesets that can be computed concurrently thanks to Kubernetes.
Distributing PromQL for Fast and Efficient Kubernetes Fleet Monitoring - Moad Zardab, Red Hat & Filip Petkovski, Shopify
Both Thanos and Cortex have enabled the cloud native ecosystem to scale Prometheus storage with the use of blocks of data persisted across many clusters into single object storage. Whilst this unlocks cheap long term retention of metrics, it presents a significant challenge of being able to efficiently read and process large volumes of data. This talk outlines the Thanos community's efforts to improve read path performance through query pushdown and query sharding and how it compares with existing Cortex approaches. Thanos deployment's are composed of stores; components that expose a consistent Prometheus compliant read API for retrieving timeseries, and queriers; components that combine raw timeseries and evaluate PromQL expressions against them. Query pushdown gives the opportunity to pre-evaluate these expressions closer to the data, while query sharding breaks down a query into distinct, disassociated datesets that can be computed concurrently thanks to Kubernetes.
- 5 participants
- 30 minutes
1 Jun 2022
Don’t miss out! Join us at our upcoming hybrid event: KubeCon + CloudNativeCon North America 2022 from October 24-28 in Detroit (and online!). Learn more at https://kubecon.io. The conference features presentations from developers and end users of Kubernetes, Prometheus, Envoy, and all of the other CNCF-hosted projects.
From Student to SRE That Loves CNCF in No Time - Jacob Valdemar Andreasen, Lunar
A year ago Jacob knew nothing about Kubernetes, Linkerd, or any other fancy CNCF projects. Jacob had been studying Software Technology for two years where he learned to write code and use software design patterns. In his fifth semester, Jacob decided to try something new and joined Lunar as a Site Reliability Engineer for a five-month full-time internship where he contributed to CNCF projects and learned to develop and maintain a GitOps based Kubernetes platform. By constantly questioning his knowledge and pushing his boundaries Jacob steadily learned how to navigate the CNCF environment. Now, 1 year and 6 months later, Jacob is a Certified Kubernetes Administrator and continues to work at Lunar where he plays with Kubernetes, Flux, Fluent Bit, Prometheus, Backstage, Linkerd, and many other exciting CNCF projects. By following Jacob’s journey we will explore the opportunities and obstacles you face as a student wanting to start their journey towards working as a platform engineer.
From Student to SRE That Loves CNCF in No Time - Jacob Valdemar Andreasen, Lunar
A year ago Jacob knew nothing about Kubernetes, Linkerd, or any other fancy CNCF projects. Jacob had been studying Software Technology for two years where he learned to write code and use software design patterns. In his fifth semester, Jacob decided to try something new and joined Lunar as a Site Reliability Engineer for a five-month full-time internship where he contributed to CNCF projects and learned to develop and maintain a GitOps based Kubernetes platform. By constantly questioning his knowledge and pushing his boundaries Jacob steadily learned how to navigate the CNCF environment. Now, 1 year and 6 months later, Jacob is a Certified Kubernetes Administrator and continues to work at Lunar where he plays with Kubernetes, Flux, Fluent Bit, Prometheus, Backstage, Linkerd, and many other exciting CNCF projects. By following Jacob’s journey we will explore the opportunities and obstacles you face as a student wanting to start their journey towards working as a platform engineer.
- 2 participants
- 24 minutes
1 Jun 2022
Don’t miss out! Join us at our upcoming hybrid event: KubeCon + CloudNativeCon North America 2022 from October 24-28 in Detroit (and online!). Learn more at https://kubecon.io. The conference features presentations from developers and end users of Kubernetes, Prometheus, Envoy, and all of the other CNCF-hosted projects.
How to Migrate 700 Kubernetes Clusters to Cluster API with Zero Downtime - Tobias Giese & Sean Schneeweiss, Mercedes-Benz Tech Innovation
Cluster API promises "to simplify provisioning, upgrading, and operating multiple Kubernetes clusters." Do you find it challenging to migrate your existing Kubernetes cluster provisioning to Cluster API? Would you like to benefit from all the features that Cluster API offers and manage your infrastructure the Kubernetes style? At Mercedes-Benz, we run and operate more than 700 Kubernetes clusters and 3,500 machines all over the world in on-premises OpenStack data centers. By migrating to Cluster API, we replaced our legacy provisioning, consisting of Terraform, custom self-written tools and Kubernetes operators. Expect valuable insights on what it takes to transfer production systems into the control of Cluster API with zero downtime and zero customer impact. Get to know the technical challenges of migrating, how they can be solved and how to extend Cluster API functionality to fit your needs.
How to Migrate 700 Kubernetes Clusters to Cluster API with Zero Downtime - Tobias Giese & Sean Schneeweiss, Mercedes-Benz Tech Innovation
Cluster API promises "to simplify provisioning, upgrading, and operating multiple Kubernetes clusters." Do you find it challenging to migrate your existing Kubernetes cluster provisioning to Cluster API? Would you like to benefit from all the features that Cluster API offers and manage your infrastructure the Kubernetes style? At Mercedes-Benz, we run and operate more than 700 Kubernetes clusters and 3,500 machines all over the world in on-premises OpenStack data centers. By migrating to Cluster API, we replaced our legacy provisioning, consisting of Terraform, custom self-written tools and Kubernetes operators. Expect valuable insights on what it takes to transfer production systems into the control of Cluster API with zero downtime and zero customer impact. Get to know the technical challenges of migrating, how they can be solved and how to extend Cluster API functionality to fit your needs.
- 8 participants
- 31 minutes
1 Jun 2022
Don’t miss out! Join us at our upcoming hybrid event: KubeCon + CloudNativeCon North America 2022 from October 24-28 in Detroit (and online!). Learn more at https://kubecon.io. The conference features presentations from developers and end users of Kubernetes, Prometheus, Envoy, and all of the other CNCF-hosted projects.
Jet Energy Corrections with GNN Regression using Kubeflow at CERN - Daniel Holmberg & Dejan Golubovic, CERN
The Large Hadron Collider is the world’s largest particle accelerator measuring 27 km in circumference. It accelerates beams of particles in opposite directions almost to the speed of light before making them collide. The particles emerging from the collisions are then measured in large detectors such as the Compact Muon Solenoid. An especially important object of study are so-called jets composed of multiple particles shooting out in the same direction from the collision point. Data-driven methods are used to correct the energy values for these jets, and what we’ll present here is the utilization of Kubeflow to enable state-of-the-art graph neural network based corrections. Kubeflow’s pipeline component allows us to define our machine learning workflow in a well-structured and reproducible manner, and its built-in training operators are used to scale up the training with ease. This work is expected to pave the way for future adoption of Kubeflow among the physics community at CERN.
Jet Energy Corrections with GNN Regression using Kubeflow at CERN - Daniel Holmberg & Dejan Golubovic, CERN
The Large Hadron Collider is the world’s largest particle accelerator measuring 27 km in circumference. It accelerates beams of particles in opposite directions almost to the speed of light before making them collide. The particles emerging from the collisions are then measured in large detectors such as the Compact Muon Solenoid. An especially important object of study are so-called jets composed of multiple particles shooting out in the same direction from the collision point. Data-driven methods are used to correct the energy values for these jets, and what we’ll present here is the utilization of Kubeflow to enable state-of-the-art graph neural network based corrections. Kubeflow’s pipeline component allows us to define our machine learning workflow in a well-structured and reproducible manner, and its built-in training operators are used to scale up the training with ease. This work is expected to pave the way for future adoption of Kubeflow among the physics community at CERN.
- 2 participants
- 24 minutes
1 Jun 2022
Don’t miss out! Join us at our upcoming hybrid event: KubeCon + CloudNativeCon North America 2022 from October 24-28 in Detroit (and online!). Learn more at https://kubecon.io. The conference features presentations from developers and end users of Kubernetes, Prometheus, Envoy, and all of the other CNCF-hosted projects.
Keep Calm and Containerd On! - Anusha Ragunathan, Intuit Inc
Letting go isn't easy! Especially when it comes to your Kubernetes cluster’s CRI implementation. Like most big Kubernetes deployments, Intuit’s 200+ clusters with 20000 nodes were running ‘dockerd’ as the CRI runtime, with dependencies on the docker API and CLI. We migrated our fleet of clusters to ‘containerd’. Whether you have a complicated Kubernetes installation with customized cluster addons or a simple set of clusters, you will be affected by the upcoming removal of dockerd from upstream Kubernetes. Come listen to us, learn from our journey and be prepared to make this migration smooth and seamless. We will share lessons learned migrating clusters to containerd. From issues faced with log management, SELinux and GPU support, to rewiring cluster addons related to CNI and runtime security, this talk is about Intuit’s journey moving to containerd. We will also talk about rollout of containerd to our production clusters and how we handled compatibility issues during cluster upgrades.
Keep Calm and Containerd On! - Anusha Ragunathan, Intuit Inc
Letting go isn't easy! Especially when it comes to your Kubernetes cluster’s CRI implementation. Like most big Kubernetes deployments, Intuit’s 200+ clusters with 20000 nodes were running ‘dockerd’ as the CRI runtime, with dependencies on the docker API and CLI. We migrated our fleet of clusters to ‘containerd’. Whether you have a complicated Kubernetes installation with customized cluster addons or a simple set of clusters, you will be affected by the upcoming removal of dockerd from upstream Kubernetes. Come listen to us, learn from our journey and be prepared to make this migration smooth and seamless. We will share lessons learned migrating clusters to containerd. From issues faced with log management, SELinux and GPU support, to rewiring cluster addons related to CNI and runtime security, this talk is about Intuit’s journey moving to containerd. We will also talk about rollout of containerd to our production clusters and how we handled compatibility issues during cluster upgrades.
- 3 participants
- 27 minutes
1 Jun 2022
Don’t miss out! Join us at our upcoming hybrid event: KubeCon + CloudNativeCon North America 2022 from October 24-28 in Detroit (and online!). Learn more at https://kubecon.io. The conference features presentations from developers and end users of Kubernetes, Prometheus, Envoy, and all of the other CNCF-hosted projects.
Keynote: 7 Years of Running Kubernetes for Mercedes-Benz - Jens Erat, DevOps Engineer; Peter Mueller, Lead Expert; Sabine Wolz, Product Owner, Mercedes-Benz Tech Innovation
Years ago, software engineers faced hard times at Mercedes-Benz: spreadsheet operations, manual processes, grown infrastructure and strict governance. A grassroots initiative of engineers accepted the challenge to change the game – and their silver bullet was Kubernetes. Join us on our journey from introducing Kubernetes 0.9 on managed servers to an on-premises self-service cloud platform with close to 1000 clusters on Cluster API. You will learn about our stake transforming a data center with a young team that mostly did not know enterprise processes before. We describe how mixing naive visions and a strong believe in open source with lots of resilience made the project a success.
Keynote: 7 Years of Running Kubernetes for Mercedes-Benz - Jens Erat, DevOps Engineer; Peter Mueller, Lead Expert; Sabine Wolz, Product Owner, Mercedes-Benz Tech Innovation
Years ago, software engineers faced hard times at Mercedes-Benz: spreadsheet operations, manual processes, grown infrastructure and strict governance. A grassroots initiative of engineers accepted the challenge to change the game – and their silver bullet was Kubernetes. Join us on our journey from introducing Kubernetes 0.9 on managed servers to an on-premises self-service cloud platform with close to 1000 clusters on Cluster API. You will learn about our stake transforming a data center with a young team that mostly did not know enterprise processes before. We describe how mixing naive visions and a strong believe in open source with lots of resilience made the project a success.
- 3 participants
- 16 minutes
1 Jun 2022
Don’t miss out! Join us at our upcoming hybrid event: KubeCon + CloudNativeCon North America 2022 from October 24-28 in Detroit (and online!). Learn more at https://kubecon.io. The conference features presentations from developers and end users of Kubernetes, Prometheus, Envoy, and all of the other CNCF-hosted projects.
Keynote: Building Bridges: Cloud Native and High Performance Computing - Ricardo Rocha, Computing Engineer, CERN
Kubernetes and Cloud Native have taken over the modern IT deployments, but challenges remain in areas where its impact can still be much larger. The world of High Performance Computing (HPC), often air-gapped, tightly secured and with strong requirements regarding resource sharing for compute intensive workloads has yet to see its needs fully accomplished. This session will cover recent work where the last bits of the bridge are being finished.
Keynote: Building Bridges: Cloud Native and High Performance Computing - Ricardo Rocha, Computing Engineer, CERN
Kubernetes and Cloud Native have taken over the modern IT deployments, but challenges remain in areas where its impact can still be much larger. The world of High Performance Computing (HPC), often air-gapped, tightly secured and with strong requirements regarding resource sharing for compute intensive workloads has yet to see its needs fully accomplished. This session will cover recent work where the last bits of the bridge are being finished.
- 1 participant
- 19 minutes
1 Jun 2022
Don’t miss out! Join us at our upcoming hybrid event: KubeCon + CloudNativeCon North America 2022 from October 24-28 in Detroit (and online!). Learn more at https://kubecon.io. The conference features presentations from developers and end users of Kubernetes, Prometheus, Envoy, and all of the other CNCF-hosted projects.
Keynote: Landscape Sustainability: The Pillars of Cloud Native Growth - Dave Zolotusky, Software Engineer, Spotify & Katie Gamanji, Senior Kubernetes Field Engineer, Apple
The CNCF ecosystem provides a vendor-neutral space for contributors and adopters to share their technical advancements, and cross-check innovation strategies while seeking guidance on emerging technologies. The TOC members steer the evolution of the CNCF landscape, prompting the growth of the adopter base and the number of use cases where cloud native technology can be applied. Organic adoption and development of new tools created the ecosystem and community as we know it today. However, to continue our growth, we need to identify the sustainability pillars of our community.
This keynote will feature updates on TOC strategy and core pillars that define the future of the CNCF landscape in a scalable and sustainable manner. It will highlight the latest impactful projects and initiatives that drive the ubiquity of the cloud native.
Keynote: Landscape Sustainability: The Pillars of Cloud Native Growth - Dave Zolotusky, Software Engineer, Spotify & Katie Gamanji, Senior Kubernetes Field Engineer, Apple
The CNCF ecosystem provides a vendor-neutral space for contributors and adopters to share their technical advancements, and cross-check innovation strategies while seeking guidance on emerging technologies. The TOC members steer the evolution of the CNCF landscape, prompting the growth of the adopter base and the number of use cases where cloud native technology can be applied. Organic adoption and development of new tools created the ecosystem and community as we know it today. However, to continue our growth, we need to identify the sustainability pillars of our community.
This keynote will feature updates on TOC strategy and core pillars that define the future of the CNCF landscape in a scalable and sustainable manner. It will highlight the latest impactful projects and initiatives that drive the ubiquity of the cloud native.
- 2 participants
- 15 minutes
1 Jun 2022
Don’t miss out! Join us at our upcoming hybrid event: KubeCon + CloudNativeCon North America 2022 from October 24-28 in Detroit (and online!). Learn more at https://kubecon.io. The conference features presentations from developers and end users of Kubernetes, Prometheus, Envoy, and all of the other CNCF-hosted projects.
Keynote: Push It to the Limit: From Canary Deployments to Canary Clusters - Henrik Høegh, Platform Engineer, Lunar
At Lunar bank we had a good problem, our customers rely on us to move quickly and provide new features and to do so in a highly reliable manner. To meet their needs we set out on a journey to move from canary deployments, where we could test new features in a safe fashion, to canary clusters. We envisioned a world where our production clusters were truly disposable and after 3 years we finally achieved that goal. In this session we will share how we did it, and how you can too. Today any engineer at Lunar bank can fail over the entire platform in 40 minutes. By deeply integrating with our infrastructure provider, writing some new custom operators, and moving most state out of the cluster Lunar is in a position to make disaster recovery a day to day operation. Listen as Henrik shares the successes, key learnings, and challenges we faced along the way.
Keynote: Push It to the Limit: From Canary Deployments to Canary Clusters - Henrik Høegh, Platform Engineer, Lunar
At Lunar bank we had a good problem, our customers rely on us to move quickly and provide new features and to do so in a highly reliable manner. To meet their needs we set out on a journey to move from canary deployments, where we could test new features in a safe fashion, to canary clusters. We envisioned a world where our production clusters were truly disposable and after 3 years we finally achieved that goal. In this session we will share how we did it, and how you can too. Today any engineer at Lunar bank can fail over the entire platform in 40 minutes. By deeply integrating with our infrastructure provider, writing some new custom operators, and moving most state out of the cluster Lunar is in a position to make disaster recovery a day to day operation. Listen as Henrik shares the successes, key learnings, and challenges we faced along the way.
- 1 participant
- 15 minutes
1 Jun 2022
Don’t miss out! Join us at our upcoming hybrid event: KubeCon + CloudNativeCon North America 2022 from October 24-28 in Detroit (and online!). Learn more at https://kubecon.io. The conference features presentations from developers and end users of Kubernetes, Prometheus, Envoy, and all of the other CNCF-hosted projects.
Keynote: Securing Shopify's Software Supply Chain - Shane Lawrence, Staff Infrastructure Security Engineer, Shopify
Recent compromises of Codecov and Solar Winds have put a spotlight on software supply chain attacks, but this focus has led to new innovations for solving an old problem. In this talk, we'll discuss lessons that Shopify has learned in protecting millions of businesses and demonstrate these techniques using open source software. We'll look at how traditional defensive techniques can be applied in the cloud, how voucher and grafeas implementations can give you control over the software that runs in your clusters, and how the SLSA framework can guide you toward establishing trust in your software. We'll also look at how Falco can be used to detect malicious behaviour or indicators that your supply chain has been compromised. Attendees can expect to learn how to apply specific techniques for mitigating supply chain attacks.
Keynote: Securing Shopify's Software Supply Chain - Shane Lawrence, Staff Infrastructure Security Engineer, Shopify
Recent compromises of Codecov and Solar Winds have put a spotlight on software supply chain attacks, but this focus has led to new innovations for solving an old problem. In this talk, we'll discuss lessons that Shopify has learned in protecting millions of businesses and demonstrate these techniques using open source software. We'll look at how traditional defensive techniques can be applied in the cloud, how voucher and grafeas implementations can give you control over the software that runs in your clusters, and how the SLSA framework can guide you toward establishing trust in your software. We'll also look at how Falco can be used to detect malicious behaviour or indicators that your supply chain has been compromised. Attendees can expect to learn how to apply specific techniques for mitigating supply chain attacks.
- 1 participant
- 16 minutes
1 Jun 2022
Don’t miss out! Join us at our upcoming hybrid event: KubeCon + CloudNativeCon North America 2022 from October 24-28 in Detroit (and online!). Learn more at https://kubecon.io. The conference features presentations from developers and end users of Kubernetes, Prometheus, Envoy, and all of the other CNCF-hosted projects.
Keynote: The Cloud Native Chasm - Emily Fox, Security Engineer, Apple
They jumped, they leapt, they soared - and graduated. Cloud native projects each have their own path for crossing the chasm. Some projects front load - they spend a lot of time going over every detail, building governance and technical strategy while others run and jump, catching their breath on the other side to regain composure, build endurance, and continue moving forward. But for some projects, they’re still on the precipice, staring wistfully at the other edge, yearning to get there. There is no road to drive down, and no bridge to cross. In this talk, we’ll explore what it means to build that bridge, add side rails, and enable other projects continue improving that bridge, bringing everyone forward together.
Keynote: The Cloud Native Chasm - Emily Fox, Security Engineer, Apple
They jumped, they leapt, they soared - and graduated. Cloud native projects each have their own path for crossing the chasm. Some projects front load - they spend a lot of time going over every detail, building governance and technical strategy while others run and jump, catching their breath on the other side to regain composure, build endurance, and continue moving forward. But for some projects, they’re still on the precipice, staring wistfully at the other edge, yearning to get there. There is no road to drive down, and no bridge to cross. In this talk, we’ll explore what it means to build that bridge, add side rails, and enable other projects continue improving that bridge, bringing everyone forward together.
- 1 participant
- 11 minutes
1 Jun 2022
Don’t miss out! Join us at our upcoming hybrid event: KubeCon + CloudNativeCon North America 2022 from October 24-28 in Detroit (and online!). Learn more at https://kubecon.io. The conference features presentations from developers and end users of Kubernetes, Prometheus, Envoy, and all of the other CNCF-hosted projects.
Linkerd End User Panel: Case Studies from Production - Catherine Paganini, Buoyant; Kasper Nissen, Lunar; Fredrik Klingenberg, Aurum AS; Eli Goldberg, Salt Security; Christian Hüning, Finleap Connect
Paganini, Eli Goldberg
In this panel, you'll hear from end users across a variety of industries on how they use the Linkerd service mesh in real-world production scenarios. Use cases range from applying mutual TLS to encrypt and secure all service-to-service communication, load balancing gRPC requests, and troubleshooting services before they're pushed to production. Panelists represent a variety of companies with very different environments, goals, and priorities, and discussion will be focused on real-world outcomes.
Linkerd End User Panel: Case Studies from Production - Catherine Paganini, Buoyant; Kasper Nissen, Lunar; Fredrik Klingenberg, Aurum AS; Eli Goldberg, Salt Security; Christian Hüning, Finleap Connect
Paganini, Eli Goldberg
In this panel, you'll hear from end users across a variety of industries on how they use the Linkerd service mesh in real-world production scenarios. Use cases range from applying mutual TLS to encrypt and secure all service-to-service communication, load balancing gRPC requests, and troubleshooting services before they're pushed to production. Panelists represent a variety of companies with very different environments, goals, and priorities, and discussion will be focused on real-world outcomes.
- 7 participants
- 37 minutes
1 Jun 2022
Don’t miss out! Join us at our upcoming hybrid event: KubeCon + CloudNativeCon North America 2022 from October 24-28 in Detroit (and online!). Learn more at https://kubecon.io. The conference features presentations from developers and end users of Kubernetes, Prometheus, Envoy, and all of the other CNCF-hosted projects.
Production-scale Containerized Game Platform Practice in Bytedance - Chenyu Jiang, ByteDance, Inc & Viktor Farcic, Upbound
Classical games servers are running on physical machines or virtual machines to provide services to players. However, packaging game servers as in containers is quickly gaining traction across the tech landscape because of container's isolated runtime paradigm, cost efficiency and elasticity. In Bytedance, games is one of the major vertical domains and we need a mature games-centric platform to serve games from both in-house games studios and agents of game manufacturers globally. In this talk, a Bytedance's practice will be shared in establishing a Kubernetes based Game platform. It leverages multiple CNCF open source frameworks: Crossplane, KubeVela, Agones to address challenges and requirements for games to go cloud-native, such as game servers and dependency resource deployment in multi-cloud/multi-region, game servers orchestration and stateful games service autoscaling.
Production-scale Containerized Game Platform Practice in Bytedance - Chenyu Jiang, ByteDance, Inc & Viktor Farcic, Upbound
Classical games servers are running on physical machines or virtual machines to provide services to players. However, packaging game servers as in containers is quickly gaining traction across the tech landscape because of container's isolated runtime paradigm, cost efficiency and elasticity. In Bytedance, games is one of the major vertical domains and we need a mature games-centric platform to serve games from both in-house games studios and agents of game manufacturers globally. In this talk, a Bytedance's practice will be shared in establishing a Kubernetes based Game platform. It leverages multiple CNCF open source frameworks: Crossplane, KubeVela, Agones to address challenges and requirements for games to go cloud-native, such as game servers and dependency resource deployment in multi-cloud/multi-region, game servers orchestration and stateful games service autoscaling.
- 2 participants
- 30 minutes
1 Jun 2022
Don’t miss out! Join us at our upcoming hybrid event: KubeCon + CloudNativeCon North America 2022 from October 24-28 in Detroit (and online!). Learn more at https://kubecon.io. The conference features presentations from developers and end users of Kubernetes, Prometheus, Envoy, and all of the other CNCF-hosted projects.
Rook: Intro and Deep Dive with Ceph Storage - Travis Nielsen, Sebastien Han & Blaine Gardner, Red Hat; Satoru Takeuchi, Cybozu, Inc.
The Rook project will be introduced to attendees of all levels and experience. Rook is an open source cloud-native storage operator for Kubernetes, providing the platform, framework, and support for Ceph to natively integrate with Kubernetes. A deep-dive will be presented for the Ceph storage provider to show how Rook provides stable block, shared file system, and object storage for your production data. The recent features in the v1.9 release will be covered to further illustrate how Rook continues to be enhanced for production environments. Rook was accepted as a graduated project by the Cloud Native Computing Foundation in October 2020.
Rook: Intro and Deep Dive with Ceph Storage - Travis Nielsen, Sebastien Han & Blaine Gardner, Red Hat; Satoru Takeuchi, Cybozu, Inc.
The Rook project will be introduced to attendees of all levels and experience. Rook is an open source cloud-native storage operator for Kubernetes, providing the platform, framework, and support for Ceph to natively integrate with Kubernetes. A deep-dive will be presented for the Ceph storage provider to show how Rook provides stable block, shared file system, and object storage for your production data. The recent features in the v1.9 release will be covered to further illustrate how Rook continues to be enhanced for production environments. Rook was accepted as a graduated project by the Cloud Native Computing Foundation in October 2020.
- 2 participants
- 26 minutes
1 Jun 2022
Don’t miss out! Join us at our upcoming hybrid event: KubeCon + CloudNativeCon North America 2022 from October 24-28 in Detroit (and online!). Learn more at https://kubecon.io. The conference features presentations from developers and end users of Kubernetes, Prometheus, Envoy, and all of the other CNCF-hosted projects.
SIG-Multicluster Intro and Deep Dive - Jeremy Olmsted-Thompson & Laura Lorenz, Google; Paul Morie, Apple
SIG-Multicluster is focused on solving common challenges related to the management of many Kubernetes clusters, across multiple cloud providers (so-called hybrid cloud), and applications deployed across many clusters. In this session, we'll give attendees an overview of the current status of the multi-cluster problem space in Kubernetes and of the SIG. We’ll discuss current thinking around best practices for multi-cluster deployments and what it means to be part of a ClusterSet. Then we’ll highlight current SIG projects, focused use cases, and ideas for what’s next. Most importantly, we’ll provide information on how you can get involved either as a contributor or as a user who wants to provide feedback about the SIG's current efforts and future direction. Bring your questions, problems, and ideas - help us expand the multi-cluster Kubernetes landscape.
SIG-Multicluster Intro and Deep Dive - Jeremy Olmsted-Thompson & Laura Lorenz, Google; Paul Morie, Apple
SIG-Multicluster is focused on solving common challenges related to the management of many Kubernetes clusters, across multiple cloud providers (so-called hybrid cloud), and applications deployed across many clusters. In this session, we'll give attendees an overview of the current status of the multi-cluster problem space in Kubernetes and of the SIG. We’ll discuss current thinking around best practices for multi-cluster deployments and what it means to be part of a ClusterSet. Then we’ll highlight current SIG projects, focused use cases, and ideas for what’s next. Most importantly, we’ll provide information on how you can get involved either as a contributor or as a user who wants to provide feedback about the SIG's current efforts and future direction. Bring your questions, problems, and ideas - help us expand the multi-cluster Kubernetes landscape.
- 3 participants
- 29 minutes
1 Jun 2022
Don’t miss out! Join us at our upcoming hybrid event: KubeCon + CloudNativeCon North America 2022 from October 24-28 in Detroit (and online!). Learn more at https://kubecon.io. The conference features presentations from developers and end users of Kubernetes, Prometheus, Envoy, and all of the other CNCF-hosted projects.
Spark on Kubernetes: The Elastic Story - Bowen Li & Huichao Zhao, Apple
Apache Spark is a unified analytics engine for large-scale data processing. People are moving Spark and batch workload to Kubernetes due to its uprising popularity. There are many challenges to running Spark efficiently on Kubernetes, for example, supporting autoscaling-based workloads. In this talk, we discuss building a large scale Spark Service on top of Kubernetes. We will also walk through autoscaling on a multi-tenant platform with advanced features such as physical isolation, min/max capacity setting, bin-packing, scale-in and scale out controls, and more. These improvements show significant CPU and memory utilization savings for Spark on Kubernetes.
Spark on Kubernetes: The Elastic Story - Bowen Li & Huichao Zhao, Apple
Apache Spark is a unified analytics engine for large-scale data processing. People are moving Spark and batch workload to Kubernetes due to its uprising popularity. There are many challenges to running Spark efficiently on Kubernetes, for example, supporting autoscaling-based workloads. In this talk, we discuss building a large scale Spark Service on top of Kubernetes. We will also walk through autoscaling on a multi-tenant platform with advanced features such as physical isolation, min/max capacity setting, bin-packing, scale-in and scale out controls, and more. These improvements show significant CPU and memory utilization savings for Spark on Kubernetes.
- 2 participants
- 21 minutes
1 Jun 2022
Don’t miss out! Join us at our upcoming hybrid event: KubeCon + CloudNativeCon North America 2022 from October 24-28 in Detroit (and online!). Learn more at https://kubecon.io. The conference features presentations from developers and end users of Kubernetes, Prometheus, Envoy, and all of the other CNCF-hosted projects.
TikTok’s Story: How To Manage a Thousand Applications on Edge With Argo CD - Qingkun Li, TikTok/Bytedance Inc. & Jesse Suen, Akuity, Inc.
This talk will share a case study of how TikTok manages its global edge clusters with Kubernetes and operates continuous delivery with Argo CD concluding with a demo. The talk will also dive into the scalability challenges faced by Tiktok to manage edge services using Argo CD (with ~100 edge clusters, ~150k CPUs and ~3000 applications), as well as how the Argo community plans to address them in future. TikTok operates a large network of Kubernetes edge clusters around the world, hosting apps such as Tiktok, live and gaming, using cache and traffic acceleration services offered at our edge clusters. The challenge arises when it comes to the deployment management of those edge services on hundreds of edge clusters. Normally, an edge service shares a lot of common configurations when deployed globally, but still has cluster-specific configurations (e.g. resource quota, service hostname, etc.). From this talk, people will learn how to deploy and manage such kind of services using Argo CD.
TikTok’s Story: How To Manage a Thousand Applications on Edge With Argo CD - Qingkun Li, TikTok/Bytedance Inc. & Jesse Suen, Akuity, Inc.
This talk will share a case study of how TikTok manages its global edge clusters with Kubernetes and operates continuous delivery with Argo CD concluding with a demo. The talk will also dive into the scalability challenges faced by Tiktok to manage edge services using Argo CD (with ~100 edge clusters, ~150k CPUs and ~3000 applications), as well as how the Argo community plans to address them in future. TikTok operates a large network of Kubernetes edge clusters around the world, hosting apps such as Tiktok, live and gaming, using cache and traffic acceleration services offered at our edge clusters. The challenge arises when it comes to the deployment management of those edge services on hundreds of edge clusters. Normally, an edge service shares a lot of common configurations when deployed globally, but still has cluster-specific configurations (e.g. resource quota, service hostname, etc.). From this talk, people will learn how to deploy and manage such kind of services using Argo CD.
- 2 participants
- 26 minutes
1 Jun 2022
Don’t miss out! Join us at our upcoming hybrid event: KubeCon + CloudNativeCon North America 2022 from October 24-28 in Detroit (and online!). Learn more at https://kubecon.io. The conference features presentations from developers and end users of Kubernetes, Prometheus, Envoy, and all of the other CNCF-hosted projects.
What Is the CNCF TAG Observability and How You Can Join Our Effort! - Bartłomiej Płotka, Red Hat; Alolita Sharma, Amazon; Richard Hartmann, Grafana Labs; Matthew Young, Everquote
The Technical Advisory Groups (TAGs, formerly SIGs) were created by the CNCF to help provide technical guidance and expertise across projects pertaining to a specific domain. One of the domains that are continuously demanded and show great innovation within the cloud-native ecosystem is Observability. In this talk, the TAG Observability co-chairs and team lead will introduce the activities we do with TAG members and the community. You will learn what has been done so far and how you can join us in our efforts to improve the observability ecosystem around us. We are looking for a diversity of skills and backgrounds, so we can give out more supporting materials and best practices for end-users and contributors around monitoring and observability. The audience will learn what TAG Observability is responsible for, how to contribute and where to find us. Hopefully, this talk will inspire your to join our Observability community!
What Is the CNCF TAG Observability and How You Can Join Our Effort! - Bartłomiej Płotka, Red Hat; Alolita Sharma, Amazon; Richard Hartmann, Grafana Labs; Matthew Young, Everquote
The Technical Advisory Groups (TAGs, formerly SIGs) were created by the CNCF to help provide technical guidance and expertise across projects pertaining to a specific domain. One of the domains that are continuously demanded and show great innovation within the cloud-native ecosystem is Observability. In this talk, the TAG Observability co-chairs and team lead will introduce the activities we do with TAG members and the community. You will learn what has been done so far and how you can join us in our efforts to improve the observability ecosystem around us. We are looking for a diversity of skills and backgrounds, so we can give out more supporting materials and best practices for end-users and contributors around monitoring and observability. The audience will learn what TAG Observability is responsible for, how to contribute and where to find us. Hopefully, this talk will inspire your to join our Observability community!
- 8 participants
- 30 minutes
1 Jun 2022
Don’t miss out! Join us at our upcoming hybrid event: KubeCon + CloudNativeCon North America 2022 from October 24-28 in Detroit (and online!). Learn more at https://kubecon.io. The conference features presentations from developers and end users of Kubernetes, Prometheus, Envoy, and all of the other CNCF-hosted projects.
Your Manila CephFS Share Backups Belong to S3 - Robert Vasek, CERN
Backups. Boring and mundane, until you lose your application data and need it back. Our Kubernetes users at CERN make extensive use of CephFS-backed storage managed by the OpenStack Manila service. Streamlining and automating the process of backups gives them a chance to prepare so that–should a disaster strike–they can recover. In this talk we will delve into the work we have done to make Velero, CephFS, Manila and an S3 store cooperate together, and bring an application back into life. Expect code snippets and demos. By the end of this session, you should have a clear overview of how each component contributes to our current backup and restore workflow in Kubernetes, and how you can integrate this setup into your clusters too.
Your Manila CephFS Share Backups Belong to S3 - Robert Vasek, CERN
Backups. Boring and mundane, until you lose your application data and need it back. Our Kubernetes users at CERN make extensive use of CephFS-backed storage managed by the OpenStack Manila service. Streamlining and automating the process of backups gives them a chance to prepare so that–should a disaster strike–they can recover. In this talk we will delve into the work we have done to make Velero, CephFS, Manila and an S3 store cooperate together, and bring an application back into life. Expect code snippets and demos. By the end of this session, you should have a clear overview of how each component contributes to our current backup and restore workflow in Kubernetes, and how you can integrate this setup into your clusters too.
- 4 participants
- 36 minutes