Use Code TRYNOW15 for a One-Time, Extra 15% OFF at KodeKloud
AI
DevOps

Learn By Doing: Build, Track, and Orchestrate AI Models with MLflow and Kubeflow

Learn to move from Jupyter notebooks to production MLOps for AIOps use cases. Build reproducible and scalable ML workflows using MLflow and Kubeflow Pipelines, deploy models on Kubernetes, and automate training-to-deployment pipelines with hands-on labs.
Nourhan Mohamed
DevOps Lead | Cloud Native Enthusiast | Golden Kubestronaut
DevOps Pre-Requisite Course
Play Button
Fill this form to get a notification when course is released.
book
4
Lessons
book
Challenges
Article icon
9
Topics

What you’ll learn

Our students work at..

Description

This course takes you from ad-hoc Jupyter notebooks to production-style MLOps workflows tailored for AIOps use cases such as anomaly detection and incident prediction. You will learn how to make models reproducible, trackable, and deployable at scale using MLflow for experiment tracking and model packaging, and Kubeflow Pipelines for end-to-end orchestration on Kubernetes. Through hands-on labs, you will convert exploratory code into parameterized scripts, track experiments with MLflow and MinIO, deploy inference services to Kubernetes, and automate train → register → validate → deploy pipelines for AIOps workloads.

Prerequisites

  1. Python Basics
  2. Kubernetes Basics
  3. Machine Learning Fundamentals

Course Highlights

  1. Why MLOps Is Critical for AIOpsThis module explains the shift from exploratory notebook-based experimentation to production-grade ML workflows for AIOps, showing how MLOps practices improve reliability, observability, and governance for models used in incident management and anomaly detection.
    • Lab 1.1 – From Notebook to Production
      • Convert an anomaly detection Jupyter notebook into a train.py script.
      • Introduce CLI arguments, random seeds, and reproducibility best practices.
      • Run multiple configurations manually to prepare for later automation.
  2. Experiment Tracking & Model Packaging with MLflowThis module introduces MLflow Tracking for capturing parameters, metrics, and artifacts across experiments, and shows how to configure MinIO as an S3-compatible artifact store integrated with an MLflow Tracking Server for reproducible AIOps workflows.
    • Lab 2.1 – Setting Up MLflow & MinIO
      • Deploy MinIO and MLflow Tracking Server.
      • Verify UI access and connectivity between MLflow and MinIO as an artifact backend.
    • Lab 2.2 – Logging Parameters, Metrics, and Artifacts
      • Instrument train.py with MLflow Tracking API calls.
      • Log parameters, metrics, and model artifacts, then explore runs in the MLflow UI.
    • Lab 2.3 – Packaging Models for Reproducibility
      • Create MLproject and conda.yaml files for reproducible runs.
      • Define entry points and re-run experiments using the MLflow CLI.
  3. Deploying & Serving AIOps ModelsThis module focuses on moving from trained models to live inference endpoints suitable for real-time anomaly detection, covering different ways to serve MLflow models and expose them via REST APIs for consumption by AIOps systems.
    • Lab 3.1 – Serving Models with MLflow
      • Serve the trained model locally from MLflow runs.
      • Register the model in the MLflow Model Registry and serve it from there.
      • Test predictions using curl and Python requests.
    • Lab 3.2 – Containerizing and Deploying to Kubernetes
      • Package the serving application into a Docker image.
      • Deploy the model-serving service to Kubernetes using Deployment and Service resources.
      • Verify access to predictions through a REST API endpoint.
  4. Orchestrating AIOps Pipelines with KubeflowThis module teaches how to automate the full ML lifecycle — from training to validation to deployment — using Kubeflow Pipelines, and how to connect Kubeflow components with MLflow to build traceable, production-style AIOps pipelines.
    • Lab 4.1 – Exploring Kubeflow Pipelines
      • Access the Kubeflow UI and inspect pre-built sample pipelines.
      • Run sample pipelines and observe execution graphs and artifacts.
    • Lab 4.2 – Building the Training & Registration Components
      • Create a train component that uses the existing train.py script.
      • Create a register component that pushes the trained model to the MLflow Model Registry.
      • Run a two-step Kubeflow pipeline to train and log a model to MLflow.
    • Lab 4.3 – Building the Full Train → Validate → Deploy Pipeline
      • Add a validate component that checks whether the model’s anomaly rate is within acceptable limits.
      • Add a deploy component that consumes the trained model and deploys it to production.
      • Compile, upload, and trigger a four-step Kubeflow pipeline to automate model training and serving end to end.
Read More

What our students say

About the instructor

Nourhan Mohamed is a DevOps Instructor and Cloud Native Enthusiast at KodeKloud, specializing in Kubernetes, Docker, CI/CD, and cloud-native technologies. As a Golden Kubestronaut, she focuses on container orchestration, automation, and troubleshooting. At KodeKloud, she designs hands-on DevOps labs that bridge theory with real-world application, empowering learners to build scalable and resilient systems.

No items found.

Why MLOps Is Critical for AIOps

lock
lock
1
Topics
Lesson Content
From Notebook to Production

Experiment Tracking & Model Packaging with MLflow

lock
lock
3
Topics
Lesson Content

Module Content

Setting Up MLflow and MinIO
Logging Parameters, Metrics and Artifacts
Packaging Models For Reproducibility

Deploying & Serving AIOps Models

lock
lock
2
Topics
Lesson Content

Module Content

Serving Models With MLflow
Containerizing and Deploying to Kubernetes

Orchestrating AIOps Pipelines with Kubeflow

lock
lock
3
Topics
Lesson Content

Module Content

Exploring Kubeflow Pipelines
Building the Training & Registration Components
Building the Full Train → Validate → Deploy Pipeline
Play Button
Fill this form to get a notification when course is released.
This course comes with hands-on cloud labs
book
4
Modules
book
Lessons
Article icon
9
Lessons
check mark
Course Certificate
Videos icon
Hours of Video
laptop
Hours of Labs
Story Format
Videos icon
Videos
Case Studies
ondemand_video icon
Demo
laptop
Labs
laptop
Cloud Labs
checklist
Mock exams
Quizzes
Discord Community Support
people icon
Community support
language icon
English
Closed Captions