Conversations
No description provided.
- 8 meetings
- latest Sep 2023
- 4 participants
- 50 minutes
- 1 per year
Featured Videos
Start here to learn more about OctoML
- 11 meetings
- latest Aug 2023
- 3 participants
- 25 minutes
- 7 per year
TVMCon 2023
TVMCon covers state of the art of deep learning compilation and optimization, with a range of tutorials, research talks, case studies, and industry presentations.
Apache TVM is an open-source deep learning compiler stack for CPUs, GPUs, and specialized accelerators. It aims to close the gap between the productivity-focused deep learning frameworks, and the performance- or efficiency-oriented hardware backends.
- 44 meetings
- latest Mar 2023
- 3 participants
- 20 minutes
- >1 per day
Free Machine Learning Compilation Course
Deploying innovative AI models in different production environments becomes a common problem as AI applications become more ubiquitous in our daily lives. Deployment of both training and inference workloads bring great challenges as we start to support a combinatorial choice of models and environment. Additionally, real world applications bring with a multitude of goals, such as minimizing dependencies, broader model coverage, leveraging the emerging hardware primitives for performance, reducing memory footprint, and scaling to larger environments.
Solving these problems for training and inference involves a combination of ML programming abstractions, learning-driven search, compilation, and optimized library runtime. These themes form an emerging topic – machine learning compilation that contains active ongoing developments. In this tutorials sequence, we offer the first comprehensive treatment of its kind to study key elements in this emerging field systematically. We will learn the key abstractions to represent machine learning programs, automatic optimization techniques, and approaches to optimize dependency, memory, and performance in end-to-end machine learning deployment.
This course aims to target audiences who are working on machine learning in the wild. ML in practice is a broad topic that involves collaborations among multiple audiences, including machine learning scientists, machine learning engineers, and hardware providers.
The course requires a minimum set of prerequisites in data science and machine learning:
- Python, familiarity with numpy.
- Some background in one deep learning framework (e.g. PyTorch, TensorFlow, JAX)
- Experiences in system programming (e.g. C/CUDA) would be beneficial but not required.
- 8 meetings
- latest Aug 2022
- 1 participant
- 50 minutes
- 3 per month
Apache TVM Community
No description provided.
- 23 meetings
- latest May 2022
- 9 participants
- 50 minutes
- 1 per month
TVMCon 2021, the Machine Learning Acceleration Conference
Recordings from the Dec 2021 virtual Apache TVM and Open Source Machine Learning Acceleration Conference.
TVMCon covers the state of the art of deep learning compilation and optimization, with a range of tutorials, research talks, case studies, and industry presentations. We discuss recent advances in ML frameworks, compilers, systems and architecture support, security, training and hardware acceleration.
https://www.tvmcon.org
- 56 meetings
- latest Jan 2022
- 3 participants
- 25 minutes
- >1 per day
TVMCon 2020
No description provided.
- 24 meetings
- latest Dec 2020
- 4 participants
- 35 minutes
- >1 per day