Techno Blender
Digitally Yours.
Browsing Tag

Angelica

Building a Data Science Team. Successful data team: From who to hire… | by Angelica Lo Duca | Jul, 2022

Data ScienceSuccessful data team: From who to hire first, to how to pick the next projects to work onPhoto by Hannah Busing on UnsplashWorking as a data scientist is one of the most in-demand jobs today. Companies are looking for data scientists to help them make better decisions, drive growth, and improve operational efficiency. But building a data science team from scratch can be a challenge. In this article, we’ll explore some tips on how to build an effective data science team.The article is organized as follows:how…

What DataOps is exactly. An overview of DataOps and what makes… | by Angelica Lo Duca | Jun, 2022

Software DeploymentAn overview of DataOps and what makes it different from the other DevOps practicesPhoto by fabio on UnsplashIt’s hard to overstate the importance of data in modern enterprises. As a new buzzword, DataOps is aimed at helping organizations overcome obstacles in their data analytics processes. But what exactly is this emerging practice and how can it help businesses better leverage their data? In this article, we’ll explore how important DataOps has become by looking at its various aspects and examining…

How to Install Spark NLP. A step-by-step tutorial on how to make… | by Angelica Lo Duca | Jun, 2022

Environment SetupA step-by-step tutorial on how to make Spark NLP work on your local computerPhoto by Sigmund on UnsplashApache Spark is an open-source framework for fast and general-purpose data processing. It provides a unified engine that can run complex analytics, including Machine Learning, in a fast and distributed way.Spark NLP is an Apache Spark module that provides advanced Natural Language Processing (NLP) capabilities to Spark applications. It can be used to build complex text processing pipelines, including…

How to Perform Exploratory Data Analysis in QGis | by Angelica Lo Duca | May, 2022

Geographical data, Exploratory Data AnalysisA tutorial on how to use QGis to perform EDA of geographical data which is provided as CSV.Photo by Annie Spratt on UnsplashA data scientist may have to deal with geographic data. In this case, in addition to the classic methods of data exploration, you can also use specific tools for managing geographic data.In this article, I describe how to use QGis to perform a preliminary exploration of a geographic dataset. As an example, I use the dataset relating to the mobility of taxi…

How to Build a Web App with Data Ingested through Versatile Data Kit | by Angelica Lo Duca | May, 2022

Database, Web applicationsA step-by-step tutorial on how to build a Web application, combining the Streamlit Python library and Versatile Data Kit.Photo by Taras Shypka on UnsplashIn the last few months, I have contributed to a very interesting project, called Versatile Data Kit (VDK). VDK is a very powerful framework for ingesting inside a database data of different formats, including CSV files, JSON objects, and even data provided by REST API services.VDK is compatible with several types of databases, including but not…