Browsing Tag

Ransaka

Gaussian Mixture Model Clearly Explained | by Ransaka Ravihara | Jan, 2023

Jessie Hobb Jan 11, 2023 0

The only guide you need to learn everything about GMMPhoto by Planet Volumes on UnsplashWhen we talk about Gaussian Mixture Model (later, this will be denoted as GMM in this article), it's essential to know how the KMeans algorithm works. Because GMM is quite similar to the KMeans, more likely it's a probabilistic version of KMeans. This probabilistic feature allows GMM to be applied to many complex problems that KMeans can't fit into.In summary, KMeans have below limitations,It assumed that the clusters were spherical…

Use SMOTE with Caution. The guide on when to use and when not… | by Ransaka Ravihara | Jan, 2023

Jessie Hobb Jan 6, 2023 0

The guide on when to use and when not to use synthetic data to tackle the class imbalancePhoto by 은 하 on UnsplashIf you are a machine learning practitioner, you may face class imbalance problems more often. Class imbalance happens when there is a different class distribution in the dataset. Let's take an example. Assume we are working on a churn problem. In this specific scenario, our minor and majority classes are customer churn, and the customer stays with the current service provider. But if you explore this problem…

Machine Learning on Snowflake. snowflake is one of the leading data… | by Ransaka Ravihara | Dec, 2022

Jessie Hobb Jan 4, 2023 0

Snowflake is one of the leading data platforms out there. In this article, we will explore the capabilities of its’ snowpark Python libraryPhoto by Aaron Burden on UnsplashThroughout this article, you will learn how to use,snowpark-python functionalities for primary data preprocessingTrain and deploy machine learning models in SnowflakeDefine UDFs in a pythonic way and deploy them in snowflakeTopics covered in this article | Image by authorIf you are willing to follow along with the tutorial, you should have an Anaconda…

Quick and Easy Way to Debug SageMaker Pipelines | by Ransaka Ravihara | Oct, 2022

Jessie Hobb Oct 19, 2022 0

A brief introduction to AWS SageMaker local modePhoto by EJ Strat on UnsplashIntroductionAs you may have experienced, debugging a pipeline is time-consuming in a cloud environment. Since our programs interact with cloud-hosted services, we have many points to consider compared to the local environment. For example, imagine your pipeline failed due to a version issue in the last step of the pipeline. Maybe it has been running for 2 hours before failing. In that case, your money on a particular cloud resource is wasted.…

Learning to Rank for Product Recommendations | by Ransaka Ravihara | Sep, 2022

Jessie Hobb Sep 3, 2022 0

This article will go through how to use the popular XGBoost library for Learning-to-rank(LTR) problemsPhoto by Malvestida on UnsplashThe most common use cases of LTR are Search Engines and Recommender Systems. The ultimate goal of ranking is to order items in a meaningful order. This article will use the popular XGBoost library for movie recommendations.When starting working on LTR, my first question was, what is the difference between traditional machine learning and ranking problems? So this is what I found. Each…