Techno Blender
Digitally Yours.
Browsing Tag

Keita

Plagiarism Detection Using Transformers | by Zoumana Keita | Dec, 2022

A complete guide to building a more robust plagiarism detector using transformer-based models.Image by Agence Olloweb on UnsplashPlagiarism is one of the biggest issues in many industries, especially in academia. This phenomenon has even worsened with the rise of the internet and open information, where anyone can access any information at a click about a specific topic.Based on this observation researchers have been trying to tackle the issue using different text analysis approaches. In this conceptual article, we will…

How to Create a Virtual Environment and Use it on Jupyter Notebook | by Zoumana Keita | Dec, 2022

Use your virtual environment inside Jupyter NotebookPhoto by Glenn Carstens-Peters on UnsplashCreating a virtual environment is a good practice. It isolates the dependencies of a specific project from the rest of the packages globally installed on your operating system. As a Data Scientist, this can be beneficial when working on different projects which generally require different dependencies.At the end of this conceptual blog, you will be able to:(1) Create virtual environments using virtualenv , and conda(2) Connect…

How to Perform Speech-to-Text and Translate Any Speech to English With OpenAI’s Whisper | by Zoumana Keita | Dec, 2022

How to use cutting-edge NLP models for audio transcription to text and machine translation.Image by Jonathan Velasquez on UnsplashOpenAI is a pure player in the field of Artificial Intelligence and has made accessible to the community many AI models including GPT, CLIP, etc.Open-sourced by OpenAI, the Whisper models are considered to have approached human-level robustness and accuracy in English speech recognition.This article will try to walk you through all the steps to transform long pieces of audio into textual…

Most Common Text Processing Tasks In Natural Language Processing | by Zoumana Keita | Dec, 2022

Computers haven’t human capability, text data need processing for better understand and interpretationPhoto by Yannick Pulver on UnsplashHuman being has the capability to understand written textual information. Machines on the other hand do not have that intrinsic capability. Here is where text processing becomes important because it allows those machines to understand and analyze natural languages.In this conceptual article, we will explain how to perform the most common text-processing tasks using popular Python…

Algorithmic Bias in Healthcare and Some Strategies for Mitigating It | by Zoumana Keita | Dec, 2022

Watch out for the biases in your AI systems: it’s sometimes a matter of life or deathImage by National Cancer Institute on UnsplashI don’t know about you, but my first contact with AI was in movies full of intelligent robots. Their actions made me believe that the whole world would be conquered by those angry machines.But my belief has changed ever since because now, I believe that AI can do better than that trying to destroy the world.AI can improve healthcare and save millions of lives around the world.There are…

How to Run SQL Queries On Your Pandas DataFrames With Python | by Zoumana Keita | Dec, 2022

Run SQL queries in your Python Pandas DataframeImage by Caspar Camille Rubin on UnsplashPandas is being increasingly used by Data Scientists and Data Analysts for data analysis purposes, and it has the advantage of being part of the wider Python universe, making it accessible to many people. SQL on the other hand is known for its performance, being human-readable, and can be easily understood even by non-technical people.What if we could find a way to combine the benefits of both Pandas and SQL statements? Here is where…

Pandas and Python Tips and Tricks for Data Science and Data Analysis | by Zoumana Keita | Dec, 2022

Take your efficiency to the next level with these Pandas and Python Tricks!Photo by Andrew Neel on UnsplashThis blog regroups all the Pandas and Python tricks & tips I share on a basis on my LinkedIn page. I have decided to centralize them into a single blog to help you make the most out of your learning process by easily finding what you are looking for.The content is is divided into two main sections:Pandas tricks & tips are related to only Pandas.Python tricks & tips related to Python.This section provides…

Avoid Using “pip freeze” — Use “pipreqs” instead | by Zoumana Keita | Nov, 2022

Your project dependencies are important — manage them efficientlyImage by Ankhesenamun on UnsplashPackage management is one of the best practices of software development workflow because it facilitates the automation of software delivery.Nowadays, most Data Scientists and Machine Learning Engineers have been adopting this best practice for their pipeline automation. Even though this process is considered a good practice, the approach adopted by most practitioners might not always be efficient: the use of pip freeze.In…

The guide to choosing the right database for my project: MongoDB vs. MySQL | by Zoumana Keita | Sep, 2022

This article is a comprehensive guide to help you adopt the right type database for your use caseImage by Daniil Silantev on UnsplashConstant changes in data types such as relation and non-relational are one of many challenges faced by companies in their journey of implementing data-driven applications, which consequently makes it harder to choose the right type of database.This conceptual blog will try to make your decision-making process less tedious, with a special focus on MongoDB and MySQL, two of the most popular…

Data Preprocessing Using Pipeline in Pandas | by Zoumana Keita | Sep, 2022

Write better code using Pandas Pipe functionImage by Samuel Sianipar on UnsplashData in real life goes through many preprocessing phases such as quality assessment, cleaning, transformation, reduction, etc. Most of the time these steps are performed in a way that is not always efficient when using Pandas.What if we could have an approach that can execute all the processing functions in a chain and in the most efficient manner? This is where Pandas’s Pipe comes in handy.In this short article, we will first understand what…