Techno Blender
Digitally Yours.
Browsing Tag

Angelica

Netflix’s 2024 Previews Tease Squid Game Season 2, Avatar’s Arrival, and More

Image: NetflixLOGLINE: Orion seems a lot like your average elementary school kid—shy, unassuming, harboring a secret crush. But underneath his seemingly normal exterior, Orion is a ball of adolescent anxiety, completely consumed by irrational fears of bees, dogs, the ocean, cellphone waves, murderous gutter clowns, even falling off a cliff. But of all his fears, the thing he’s the most afraid of is what he confronts on a nightly basis: the dark. So when the literal embodiment of his worst fear pays a visit, Dark whisks

How to Keep Track of Data Versions Using Versatile Data Kit | by Angelica Lo Duca | May, 2023

Data EngineeringLearn about slow change dimensions (SCD) and how to implement SCD Type 2 in VDKPhoto by Joshua Sortino on UnsplashData is the backbone of any organization, and in today’s fast-paced world, it is crucial to keep track of its versions. As businesses grow and evolve, data undergoes numerous changes that can quickly become overwhelming without a streamlined system. Technology has come to our rescue with various tools that help us manage data efficiently.One such tool is the Versatile Data Kit (VDK), which…

How to Create a Custom NER in Spacy 3.5 | by Angelica Lo Duca | Apr, 2023

Natural Language ProcessingA quick tutorial on extracting custom entities from a textPhoto by Max Chen on UnsplashAre you tired of using generic named entity recognition (NER) models that don’t quite fit your specific needs? Look no further! This article will guide you through creating a custom NER in Spacy 3.5.With a few tweaks and training data, you can have a model that accurately identifies entities specific to your domain or use case. Say goodbye to one-size-fits-all NER models and hello to customized precision.…

3 Ways to Build a Geographical Map in Python Altair | by Angelica Lo Duca | Jan, 2023

Data Visualization, Geographical DataA data visualization tutorial on how to build three different maps in Python Altair: choropleth map, dot density map, and proportional symbol mapPhoto by Kyle Glenn on UnsplashFor data scientists, visualizing data is an essential skill. It helps to quickly understand patterns and correlations in the data that would otherwise be missed. Geographical maps are a great way to visualize spatial data and can be used to explore trends in different countries or regions.This article will show…

3 Ways to Visualize Time Series You May Not Know | by Angelica Lo Duca | Nov, 2022

Data Visualization, Time SeriesA ready-to-run tutorial on how to visualize a time series in Python and AltairPhoto by Anton Maksimov 5642.su on UnsplashA time series is a sequence of data points, typically measured at discrete time intervals. We can represent time series data using various methods, including line graphs, bar graphs, and scatter plots. In Python, we can use libraries like matplotlib, seaborn, and Altair to create time series plots.There are several ways to display time series data, such as line charts and…

How to Create a Data Formatting Plugin in VDK | by Angelica Lo Duca | Oct, 2022

Data Engineering, Data LakeA step-by-step tutorial on how to manipulate a table in your data lake by writing a VDK custom pluginPhoto by Campaign Creators on UnsplashVersatile Data Kit (VDK) is a framework for ingesting and manipulating different data sources into a single data lake. I’ve already discussed VDK in my previous article, so for an introduction to it, you can refer there.In this article, I’ll discuss how to implement a custom plugin in VDK. As a use case, I’ll implement a simple plugin that takes input from an…

Handling Missing Values in Versatile Data Kit | by Angelica Lo Duca | Aug, 2022

Data PreprocessingA tutorial on how to build data pipelines using VDK to handle missing valuesPhoto by Markus Spiske on UnsplashVMware has recently released a new framework, Versatile Data Kit (VDK), which you can use for Data Ingestion and Data Processing. VDK helps you to easily perform complex operations, such as data ingestion from different sources, using either SQL or Python. In other words, you can use VDK to build data lakes, where you ingest raw data extracted from different sources, including structured,…

The Essentials of Public Speaking for a Career in Data Science | by Angelica Lo Duca | Aug, 2022

Data ScienceSome tips on how to boost your public speaking skills as a data scientistPhoto by Kane Reinholdtsen on UnsplashData science is a relatively new field, and public speaking is an essential skill for anyone looking to build a career in this field. The ability to communicate complex data concepts clearly and effectively is essential for success in data science.If you’re data-driven and looking to boost your career in science, public speaking is a great way to get noticed. But how exactly do you do that? We’ve got…

Starting a Career as a Data Scientist | by Angelica Lo Duca | Jul, 2022

Data ScienceTips on what your Data Science career could look likePhoto by Razvan Chisu on UnsplashData science has become one of the most in-demand fields in recent years. As companies increasingly collect and generate data, they need individuals who can sift through this information to identify trends and make predictions.While a data scientist’s job may vary depending on the industry, their responsibilities usually fall into one of three categories: Type A, Type B and Type C. In this article, we’ll explore the three…

Data Science Manager vs Data Science Expert | by Angelica Lo Duca | Jul, 2022

Data ScienceWhich profile does your company need? A comparison between the two roles: skills, responsibilities, and their role in the team.Photo by krakenimages on UnsplashThere’s a lot of debate in the tech world about whether data science managers or data science experts are more valuable to organizations. Some believe that managers are better able to develop and implement strategy, while others argue that experts are more skilled at working with data.So, which is it? In this article, we’ll explore the pros and cons of…