youtube image
From YouTube: WebAssembly Based AI as a Service on the Edge with Kubernetes - Rishit Dagli & Shivay Lamba

Description

Don’t miss out! Join us at our upcoming event: KubeCon + CloudNativeCon Europe 2023 in Amsterdam, The Netherlands from April 17-21. Learn more at https://kubecon.io​. The conference features presentations from developers and end users of Kubernetes, Prometheus, Envoy, and all of the other CNCF-hosted projects.

WebAssembly Based AI as a Service on the Edge with Kubernetes - Rishit Dagli, Narayana Junior College; Incoming University of Toronto & Shivay Lamba, Meilisearch

WebAssembly (WASM) is being adopted at an increasing rate for edge applications. That allows WASM runtimes, such as WasmEdge (a lightweight and high-performance runtime for cloud-native, edge, and decentralized devices), to run serverless functions on the edge. Following the large-scale adoption and benefits of serverless computing, we focus on deploying these as a Function-as-a-service on edge devices. Machine Learning inference is often a computationally intensive task and edge applications could greatly benefit from the speed of WebAssembly. Unfortunately, Linux containers end up being too heavy for such tasks. Demonstrating Machine Learning deployments in such a fashion, another problem we face is that the standard WebAssembly provides very limited access to the native OS and hardware, such as multi-core CPUs, GPUs, or TPUs which is not ideal for the systems we target. The talk also shows how one could use the WebAssembly System Interface (WASI) to get security, portability, and native speed for ML models. To top it off this talk ends with a demo of deploying a Machine learning model as a serverless function using WASM deployed on an edge device.