How to Use UMAP For Much Faster And Effective Outlier Detection | by Bex T. | Sep, 2022
Let’s catch those high-dimensional outliersPhoto by João JesusWe’ve all used those simple techniques — plot a scatterplot or a KDE, and the data points farthest from the group are outliers. Now, tell me — how would you use these methods if you were to find outliers in, say, 100-dimensional datasets? Right off the bat, visual outlier detection methods are out of the question.So, fancy machine learning algorithms like Local Outlier Factor or Isolation Forest come to mind, which are effective against outliers in…