KServe Presentations and Demos
This page contains a collection of presentations, demos, and talks about KServe from various conferences, meetups, and community events. If you'd like to add a presentation or demo here, please send a pull request to the KServe website repository.
Conference Talks by Yearβ
2025β
| Event | Title | Speaker(s) | Resources |
|---|---|---|---|
| Cloud Native & Kubernetes AI Day North America | KServe Next: Advancing Generative AI Model Serving | Yuan Tang and Dan Sun | Info |
| KubeCon North America | Anchoring Trust in the Age of AI: Identities Across Humans, Machines, and Models | Yuan Tang and Anjali Telang | Info |
| KubeCon | Navigating the Rapid Evolution of Large Model Inference: Where does Kubernetes Fit? | Jiaxin Shan, Yuan Tang, Sergey Kanzhelev, Rita Zhang | Info |
| Cloud Native & Kubernetes AI Day Europe | Panel: Engaging the Kubeflow Community: Building an Enterprise-Ready AI/ML Platform | Yuan Tang, Andrey Velichkevich, Andreea Munteanu, Johnu George, Ronen Dar | Video |
| KubeCon Europe | Advancements in AI/ML Inference Workloads on Kubernetes From WG Serving and Ecosystem Projects | Yuan Tang, Eduardo Arango Gutierez | Video |
| KubeCon Europe | Kubeflow Ecosystem: Whatβs Next for Cloud Native AI/ML and LLMOps | Johnu George, Andrey Velichkevich, Yuki Iwai, Yuan Tang, Valentina Rodriguez | Video |
| Red Hat Summit | Embracing Partnership and Open Collaboration in the Cloud-Native and AI Model Serving Communities | Yuan Tang, Adam Tetelman, Brittany Rockwell, Tyler Michael Smith | Info |
| IBM TechXchange | KServe Deep Dive: Evolving Model Serving for the Generative AI Era | Yuan Tang | Info |
2024β
| Event | Title | Speaker(s) | Resources |
|---|---|---|---|
| KubeCon North America | Unlocking Potential of Large Models in Production | Yuan Tang | Video |
| Cloud Native & Kubernetes AI Day North America | Advancing Cloud Native AI Innovation Through Open Collaboration | Yuan Tang | Video |
| Kubernetes Podcast from Google | Kubernetes Working Group Serving | Yuan Tang, Eduardo Arango Gutierez | Podcast |
| IBM TechXchange | KServe Essentials: Building a Production-Ready Cloud-Native Model Serving Platform | Yuan Tang | Info |
| PlatformCon | Engineering Cloud Native AI Platform | Yuan Tang | Video |
| Open Data Science Conference | Highly Scalable Inference Platform for Models of Any Size | Yuan Tang | Info |
| KubeCon Europe | Production-Ready AI Platform on Kubernetes | Yuan Tang | Video |
| KubeCon North America | Engaging the KServe Community | Adam Tetelman, Taneem Ibrahim, Johnu George, Tessa Pham, Andreea Munteanu | Video |
| KubeCon Europe | Fortifying AI Security in Kubernetes with Confidential Containers | Suraj Deshmukh, Pradipta Banerjee | Video |
| KubeCon Europe | From Bash Scripts to Kubeflow and GitOps: Our Journey to Operationalizing ML at Scale | Luca Grazioli, Dennis Ohrndorf | Video |
| KubeCon North America | Production AI at Scale: Cloudera's Journey Adopting KServe | Zoram Thanga, Peter Ableda | Video |
| KubeCon North America | Optimizing Load Balancing and Autoscaling for LLM Inference on Kubernetes | David Gray | Video |
| KubeCon North America | Best Practices for Deploying LLM Inference, RAG and Fine Tuning Pipelines | Meenakshi Kaushik, Shiva Krishna Merla | Video |
| KubeCon Europe | Platform Building Blocks: How to Build ML Infrastructure with CNCF Projects | Yuzhui Liu, Leon Zhou | Info |
2023β
| Event | Title | Speaker(s) | Resources |
|---|---|---|---|
| KubeCon Europe | The State and Future of Cloud Native Model Serving | Dan Sun, Theofilos Papapanagiotou | Video |
| Kubeflow Summit | Scale your Models to Zero with Knative and KServe | Jooho Lee | Video |
| Kubeflow Summit | What to choose? ModelMesh vs Model Serving? | Vaibhav Jain | Video |
2022β
| Event | Title | Speaker(s) | Resources |
|---|---|---|---|
| KubeCon AI Days | Exploring ML Model Serving with KServe | Alexa Nicole Griffith | Video |
| KubeCon AI Days | Enhancing the Performance Testing Process for gRPC Model Inferencing at Scale | Ted Chang, Paul Van Eck | Video |
| KubeCon Edge Days | Model Serving at the Edge Made Easier | Paul Van Eck | Video |
| KnativeCon | How We Built an ML inference Platform with Knative | Dan Sun | Video |
2021β
| Event | Title | Speaker(s) | Resources |
|---|---|---|---|
| KubeCon AI Days | Serving Machine Learning Models at Scale Using KServe | Yuzhui Liu | Video |
| KubeCon | Serving Machine Learning Models at Scale Using KServe | Animesh Singh | Video |
| KubeCon China | Accelerate Federated Learning Model Deployment with KServe | Fangchi Wang & Jiahao Chen | Video |
2020β
| Event | Title | Speaker(s) | Resources |
|---|---|---|---|
| NVIDIA GTC | Accelerate and Autoscale Deep Learning Inference on GPUs with KFServing | Dan Sun, David Goodwin | Video |
| ICML Workshop | Serverless inferencing on Kubernetes | Clive Cox | Video |
| Serverless Summit | Serverless Machine Learning Inference with KFServing | Clive Cox, Yuzhui Liu | Video |
| Kubeflow Dojo | KFServing - Production Model Serving Platform | Animesh Singh, Tommy Li | Video |
| Kubeflow Dojo | Demo - KFServing End to End through Notebook | Animesh Singh, Tommy Li | Video |
| Kubeflow Dojo | Demo - KFServing with Kafka and Kubeflow Pipelines | Animesh Singh | Video |
| Kubeflow Community | KFServing - Enabling Serverless Workloads Across Model Frameworks | Ellis Tarn | Video |
| Google Cloud | Kubeflow 101: What is KFServing? | Stephanie Wong | Video |
| Anchor MLOps | MLOps Coffee Sessions - Serving Models with Kubeflow | David Aponte, Demetrios Brinkmann | Podcast |
2019β
| Event | Title | Speaker(s) | Resources |
|---|---|---|---|
| KubeCon | Introducing KFServing: Serverless Model Serving on Kubernetes | Dan Sun, Ellis Tarn | Video |
| KubeCon | Advanced Model Inferencing Leveraging KNative, Istio & Kubeflow Serving | Animesh Singh, Clive Cox | Video |
Books and Publicationsβ
Booksβ
"Distributed Machine Learning Patterns"β
- Author: Yuan Tang
- Publisher: Manning Publications
- Description: Covers distributed machine learning patterns including model serving with KServe
- Book Repository
Community Demos and Tutorialsβ
Live Demos and Tutorialsβ
"MLOps Meetup: KServe Live Coding Session"β
- Speaker: Theofilos Papapanagiotou
- Features Shown: Building a KServe deployment from scratch
- Video