An End to End Web App to detect anomalies from ECG signals with Streamlit | by Eugenia Anello | Nov, 2022

By Jessie Hobb On Nov 8, 2022

This tutorial focus on building a web application with MLflow, Sagemaker and Streamlit

This article is in continuation of the story How to deploy your ML model using DagsHub+MLflow+AWS Lambda. In the previous story, I showed how to train and deploy a model to detect irregular heart rhythms from ECG signals. The benchmark dataset was the ECG5000 dataset, which contains 5000 heartbeats randomly selected from a patient that had heart failure. It’s been frequently used in research papers and tutorials.

This time I am going to use another real-world dataset, which is much noisier and, consequently, more challenging. The data contains fields, like timestamp, patient id, and heart rate. In addition to these features, there is also a label that tells you if the heart rhythm is anomalous or not, and was obtained by manual annotation.

In this tutorial, we are going to build a web application that detects heart anomalies. Before creating the app, we are going to do some exploratory analysis, build and compare different machine learning models. Let’s get started!

We are going to focus again on detecting anomalies from ECG signals. Above there are the ECG signals of a patient, where the x markers represent the anomalies. From this example, you can guess that it’s not only anomaly detection, but it’s also peak detection. So, we need to identify peaks and establish if these peaks are anomalous.

To respect the anomaly detection formulation, the training set contains only normal ECG signals, while the test contains both normal and anomalous signals. There is also an important consideration to make: the anomalies constitute the minority class and correspond to peaks. Indeed, the test set contains less than 1% of anomalies

For these reasons, the heart rate alone is not enough to solve this problem. In addition to the heart rate, we need to create two new features. First, we build a variable that calculates the difference between the current value of the heart rate and the previous value. Another crucial feature is the peak label, which takes a value equal to 1 if there is a peak, otherwise, it returns 0.

Part 1: Model training and MLflow tracking

Like in the previous tutorial, we are going to use an awesome open-source platform to track the experiments, package machine learning models, and deploy them in production, called MLflow. It’s used together with DagsHub, which allows you to find all the resulting experiments on your repository and version your data and code efficiently. In addition to these features, you can visualize an interactive graph of the entire pipeline on DasgHub repository.

This time, I considered two models to detect heart anomalies: Autoencoder and Isolation Forest. This choice is due to the fact that the task is very challenging and the anomalies constitute a very small number compared to the normal observations. The Isolation Forest demonstrated to have better performances than Autoencoder for its characteristic assumptions: the anomalies represent the minority class and have short average path lengths on the isolation trees.

Moreover, it set up the value of a hyperparameter that corresponds to the proportion of anomalies in the dataset, also known as contamination, before training the algorithm. Isolation Forest achieved the best performances with contamination rates smaller than 1%, like 0.4% and 0.5%. It shouldn’t be a surprise that the autoencoder finds this task more problematic since it doesn’t have these types of assumptions.

In the script, we train one of the two available models and log hyperparameters and metrics. We are also interested in logging the model as an artifact and registering the model after it’s trained. These two operations can be merged using mlflow.sklearn.log_model(sk_model=,artifact_path,registered_model_name). If you still don’t want to register the model, you can avoid specifying registered_model_name parameter.

This tutorial focus on building a web application with MLflow, Sagemaker and Streamlit

Part 1: Model training and MLflow tracking

Read original article here

Denial of responsibility! Techno Blender is an automatic aggregator of the all world’s media. In each content, the hyperlink to the primary source is specified. All trademarks belong to their rightful owners, all materials to their authors. If you are the owner of the content and do not want us to publish your materials, please contact us by email – [email protected]. The content will be deleted within 24 hours.

An End to End Web App to detect anomalies from ECG signals with Streamlit | by Eugenia Anello | Nov, 2022

This tutorial focus on building a web application with MLflow, Sagemaker and Streamlit

Part 1: Model training and MLflow tracking

Part 2: Deploy MLflow model with Amazon SageMaker

1. Update Stage of MLflow Model

2. Create an AWS Account and Set up an IAM role

3. Deploy MLflow Model to a Sagemaker Endpoint

Part 3: Create a Web Application

Final thought:

This tutorial focus on building a web application with MLflow, Sagemaker and Streamlit

Part 1: Model training and MLflow tracking

Part 2: Deploy MLflow model with Amazon SageMaker

1. Update Stage of MLflow Model

2. Create an AWS Account and Set up an IAM role

3. Deploy MLflow Model to a Sagemaker Endpoint

Part 3: Create a Web Application

Final thought: