Techno Blender
Digitally Yours.
Browsing Tag

Dec

Bayesian vs. Frequentist Inference | by Dimitris Poulopoulos | Dec, 2022

Are you Bayesian or Frequentist?Image generated using Stable DiffusionRegarding the probability school of thought, there are two main approaches: the Bayesian and the frequentist approaches. Both of these approaches have their own strengths and weaknesses, and they are often used in different circumstances to help determine the probability of events.So, what are their differences? Is one better than the other? This is a wrong question to ask; Bayesian thinking is based on the idea that probabilities represent a degree of…

Use-Case-Driven Data Readiness as an Instrument to Manage and Control Data Quality | by Murali Kashaboina | Dec, 2022

A data readiness-based approach to addressing data qualityPhoto by Michael Dziedzic on UnsplashTL;DR: Data quality should be viewed through its readiness for specific consumption use cases. A data readiness approach is proposed to establish specific data consumption context through a specific use case to improve the data quality. Data readiness for a specific use case can be tested just like how software use case gets tested using various test-case scenarios. Similar to data dictionaries, a data readiness artifact must…

Python Square Roots: 5 Ways to Take Square Roots in Python | by Dario Radečić | Dec, 2022

There’s more than one way to Python square roots. Learn 5 approaches to square roots in Python + some bonus advanced tricks.Photo by Dan Cristian Pădureț on UnsplashAfter learning about 4 ways to square a number in Python, now it’s time to tackle the opposite operation — Python square roots. This article will teach you five distinct ways to take square roots in Python and will finish off with a bonus section on cube roots and square roots of Python lists.Let’s get started by introducing the topic and addressing the…

Why and How to Deprecate Dashboards | by Sarah Krasnik | Dec, 2022

When there are too many dashboards to sift through, establish an automated deprecation process to keep your BI instance clean.Your dashboards aren’t this critical. Photo from UnsplashHow many dashboards can one person reasonably gain insight from and act upon? While I don’t know the answer (this would actually be a great survey), I hope we can all agree the limit does exist. When we run before we walk and create a dashboard for every question asked without analyzing the question itself, that’s when dashboard bloat…

A Sentence Is Greater than the Sum of Its Words | by Elad Rapaport | Dec, 2022

Sentence Embeddings 101Image by Ri Butov from PixabayIn this article, I will present the topic of sentence embeddings. We will learn what sentence embeddings are and how they can be helpful. We will review some basic (and even naive) methods for creating sentence embeddings and also some recent advances in the field. We will end with an experiment I conducted on measuring document visualization quality when using different sentence embedding techniques. So without further ado, let’s jump right in!The article will be…

Pandas and Python Tips and Tricks for Data Science and Data Analysis | by Zoumana Keita | Dec, 2022

Take your efficiency to the next level with these Pandas and Python Tricks!Photo by Andrew Neel on UnsplashThis blog regroups all the Pandas and Python tricks & tips I share on a basis on my LinkedIn page. I have decided to centralize them into a single blog to help you make the most out of your learning process by easily finding what you are looking for.The content is is divided into two main sections:Pandas tricks & tips are related to only Pandas.Python tricks & tips related to Python.This section provides…

Significantly Increase Your Grid-Search Results With These Parameters | by Tomer Gabay | Dec, 2022

Grid search over any machine learning pipeline step using an EstimatorSwitchPhoto by Héctor J. Rivas on UnsplashA very common step in building a machine learning model is to grid search over a classifier’s parameters on the train set, using cross-validation, to find the most optimal parameters. What is less known, is that you can also grid search over virtually any pipeline step, such as feature engineering steps. E.g. which imputation strategy works best for numerical values? Mean, median or arbitrary? Which categorical…

9 Myths about Starting in Data Science | by Kamila Hamalcikova | Dec, 2022

Thinking about career change into data? Check common prejudices about getting job as a junior.One of my first data analysis project was fun. Photo: Czechitas (https://www.czechitas.cz/en), published with permission of Czechitas z.ú.Are you thinking about a new career in a promising data science field? If a former coal miner or opera singer can do it, you can probably do it too.I have been there, not in the mines though, nor torturing an audience with my high pitched voice. I jumped from writing articles about electric…

Basic Forecasting Techniques. An overview of some very simple… | by Egor Howell | Dec, 2022

An overview of some very simple forecasting modelsPhoto by Mark König on UnsplashForecasting is a wide domain with numerous applications in almost every industry. Due to this, the range of forecasting models is also very large with each model having its own pros and cons.In this article, I want to go over some basic and simple forecasting models. Despite their simplicity, these models can offer good results in practice and provide a good basis to iterate from.The first model we will consider is the average forecast. This…

Monte Carlo Simulation. Part 7: Charting | by Darío Weitz | Dec, 2022

Part 7: ChartingPhoto by Efe Kurnaz on UnsplashThis is the seventh article related to a numerical technique known as Monte Carlo Simulation. We will reiterate our definition, as stated in previous articles: “A Monte Carlo Simulation (MCS) is a sampling experiment whose aim is to estimate the distribution of a quantity of interest that depends on one or more stochastic input variables”.I encourage you to read some of my previous articles (MCS Part 1, MCS Part2, MCS Part3) to learn more aspects of the technique, and in…