Techno Blender
Digitally Yours.
Browsing Tag

Data set

Search for Rail Defects (Part 3)

To ensure the safety of rail traffic, non-destructive testing of rails is regularly carried out using various approaches and methods. One of the main approaches to determining the operational condition of railway rails is ultrasonic non-destructive testing. The assessment of the test results depends on the defectoscopist. The need to reduce the workload on humans and improve the efficiency of the process of analyzing ultrasonic testing data makes the task of creating an automated system relevant. The purpose of this work…

Building an SLM With the Jaro-Winkler Algorithm

Spelling errors are a common problem for many people. They can be caused by a variety of factors, such as typos, mishearing words, or simply not knowing how to spell a word correctly. While spelling errors are usually minor, they can sometimes be embarrassing or even lead to misunderstandings. The Jaro-Winkler algorithm is one of the many ways that help to accomplish such with a satisfactory result of 0.87 in Mean Average Precision (MAP), and it is a string metric that measures the similarity between two strings. It is a…

How I Learned To Love Autonomous Generative AI Versus Auto-Complete

New generative AI for code tools help developers do their jobs more quickly with less effort by synthesizing code. ChatGPT and other LLM approaches have made this kind of generative AI accessible to a broad general audience. Developers are flocking to it. But too often overlooked are the fundamental differences between generative AI for code tools, and those differences inform which approach is better for your developer tasks. Under the covers, generative AI for code tools fall into two basic categories: transformer-based…

How to Build a Recommender System Using TensorFlow

What Is a Recommender System? A recommender system is a software engine developed to suggest products and services for a given set of customers. While there are multiple ways in which these systems recommend products, the most common is by analyzing a customer's previous purchasing patterns by storing data related to previous purchases, positive and negative reviews, saves/adds to lists, views, and more. So why do businesses such as Amazon and Netflix spend small fortunes building and improving these systems? Because…

Hugging Face Text Classification Tutorial Using PyTorch

What is PyTorch? PyTorch is a deep learning open-source TensorFlow library that is based on the well-known Torch library. It's also a Python-based library that is more commonly used for natural language processing and computer vision. In this tutorial, we will be using PyTorch to train our model for Text Classification. What Is Hugging Face? Hugging Face is an open-source dataset (website) provider which is used mainly for its natural language processing (NLP) datasets among others. It contains tons of valuable…

YOLOv5 PyTorch Tutorial – DZone

Using YOLOv5 in PyTorch YOLO, an acronym for 'You only look once,' is an open-source software tool utilized for its efficient capability of detecting objects in a given image in real time. The YOLO algorithm uses convolutional neural network (CNN) models to detect objects in an image.  The algorithm requires only one forward propagation through a given neural network to detect all objects in the image. This gives the YOLO algorithm an edge in speed over others, making it one of the most well-known detection…

Introduction to NVIDIA NeMo — Tutorial and Example

NVIDIA NeMo is a toolkit for building new state-of-the-art conversational AI models. NeMo has separate collections for Automatic Speech Recognition (ASR), Natural Language Processing (NLP), and Text-to-Speech (TTS) models. Each collection consists of prebuilt modules that include everything needed to train on your data. Every module can easily be customized, extended, and composed to create new conversational AI model architectures. So let's explain briefly what is the ASR, NLP, and TTS models. ASR, short for…

Data Set Programming in Machine Learning

The results achieved by advanced machine learning algorithms may seem mind-blowingly mysterious to outsiders, but careful data set programming makes them possible. They involve things like understanding how the finished algorithm would ideally work, sourcing appropriate information, and preparing it to remove errors. Here are some critical steps to take when creating a data set to program an effective machine learning algorithm.  1. Take Time to Understand and Define the Problem or Question People normally develop machine…

Time-series Forecasting With TensorFlow and QuestDB

Machine Learning For Time-series Forecasting Machine learning is taking the world by storm, performing many tasks with human-like accuracy. In the medical field, there are now smart assistants that can check your health over time. In finance, there are tools that can predict the return on your investment with a reasonable degree of accuracy. In online marketing, there are product recommenders that suggest specific products and brands based on your purchase history. In each of these fields, a different type of data can be…

What Is Text Classification? – DZone AI

What Is Text Classification? Text Classification is the process of categorizing text into one or more different classes to organize, structure, and filter into any parameter. For example, text classification is used in legal documents, medical studies, and files, or as simple as product reviews. Data is more important than ever; companies are spending fortunes trying to extract as many insights as possible. With text/document data being much more abundant than other data types, new methods of utilizing them are…