youtube image
From YouTube: Kubernetes Love Machine Learning, Even on Private Cloud - Hui Luo, VMware

Description

Join us for Kubernetes Forums Seoul, Sydney, Bengaluru and Delhi - learn more at kubecon.io

Don't miss KubeCon + CloudNativeCon 2020 events in Amsterdam March 30 - April 2, Shanghai July 28-30 and Boston November 17-20! Learn more at kubecon.io. The conference features presentations from developers and end users of Kubernetes, Prometheus, Envoy, and all of the other CNCF-hosted projects

Kubernetes Love Machine Learning, Even on Private Cloud - Hui Luo, VMware

Kubernetes has established as a good platform for machine learning workloads by extending support of accelerators like GPU, all major public cloud provider are offering GPU enabled Kubernetes services, but public cloud is not the only option for users. There are ongoing efforts from the community to make running machine learning workloads with Kubernetes on private cloud as easy as on public cloud. This talk is going to cover 3 major challenges that facing private cloud when enable GPU on Kubernetes. I will also demonstrate and discuss some of the projects that help to solve those challenges: 1) Private cloud usually needs to support a wider range of GPU types, in some case, to support heterogeneous GPU in one cluster 2) To support complex hardware topology like RDMA, NVLINK 3) GPU resource contention is usually very high when limited GPU resource shared by multiple teams

For more info click here: https://sched.co/FuLx