From Clusters To Insights; The Next Step | by Erdogan Taskesen | May, 2023
For this use case, we will load the online shoppers’ intentions data set and go through the steps of preprocessing, clustering, evaluation and then determining the significantly associated features for the cluster labels. This data set contains in total of 12330 samples with 18 features. This mixed dataset requires a few more pre-processing steps to make sure that all variables have similar types or units of measurement. Thus, the first step is to create homogeneous data sets with units that are comparable. A common…