Training XGBoost with MLflow Experiments and HyperOpt | by Ani Madurkar | Jan, 2023

By Jessie Hobb On Jan 11, 2023

A starting point on your MLOps Journey

Colors of the Adirondacks. Image by author

As you evolve in your journey in Machine Learning, you’ll soon find yourself gravitating closer and closer to MLOps whether you like it or not. Building efficient, scalable, and resilient machine learning systems is a challenge and the real job of a Data Scientist (in my opinion) as opposed to just doing modeling.

The modeling part has been largely figured out for most use cases. Unless you’re trying to be at the bleeding edge of the craft, you’re likely dealing with structured, tabular datasets. The choice of model can vary depending on the dataset size, assumptions, and technical restrictions, but for the most part, it is fairly repeatable. My workflow for supervised learning ML during the experimentation phase has converged to using XGBoost with HyperOpt and MLflow. XGBoost for the model of choice, HyperOpt for the hyperparameter tuning, and MLflow for the experimentation and tracking.

This also represents a phenomenal step 1 as you embark on the MLOps journey because I think it’s easiest to start doing more MLOps work during the experimentation phase (model tracking, versioning, registry, etc.). It’s lightweight and highly configurable which makes it easy to scale up and down as you may need.

Although I briefly discuss XGBoost, MLflow, and HyperOpt, this isn’t a deep walkthrough of each. Initial hands-on familiarity with each would be really helpful to understand how some pieces here are working in more depth. I’ll be working with the UCI ML Breast Cancer Wisconsin (Diagnostic) dataset (CC BY 4.0).

To start, we can start an MLflow server (I discuss what’s happening here a bit later):

mlflow server \
- backend-store-uri sqlite:///mlflow.db \
- default-artifact-root ./mlruns \
- host 0.0.0.0 \
- port 5000

A starting point on your MLOps Journey

To start, we can start an MLflow server (I discuss what’s happening here a bit later):

mlflow server \
- backend-store-uri sqlite:///mlflow.db \
- default-artifact-root ./mlruns \
- host 0.0.0.0 \
- port 5000

Read original article here

Denial of responsibility! Techno Blender is an automatic aggregator of the all world’s media. In each content, the hyperlink to the primary source is specified. All trademarks belong to their rightful owners, all materials to their authors. If you are the owner of the content and do not want us to publish your materials, please contact us by email – [email protected]. The content will be deleted within 24 hours.

Training XGBoost with MLflow Experiments and HyperOpt | by Ani Madurkar | Jan, 2023

A starting point on your MLOps Journey

Model Evaluation & Registry

A starting point on your MLOps Journey

Model Evaluation & Registry