Apache Spark MLlib vs Scikit-learn: Building Machine Learning Pipelines | by Bruno Caraffa | Mar, 2023
Code implementations for ML pipelines: from raw data to predictionsPhoto by Rodion Kutsaiev on UnsplashReal-life machine learning involves a series of tasks to prepare the data before the magic predictions take place. Filling the missing values, one hot encoding for the categorical features, standardization and scaling for the numeric ones, feature extraction, and model fitting are just some of the stages that take part during a machine learning project before making any predictions. When working with NLP applications it…