Techno Blender
Digitally Yours.
Browsing Tag

large language model

Implement RAG Using Weaviate, LangChain4j, and LocalAI

In this blog, you will learn how to implement Retrieval Augmented Generation (RAG) using Weaviate, LangChain4j, and LocalAI. This implementation allows you to ask questions about your documents using natural language. Enjoy! 1. Introduction In the previous post, Weaviate was used as a vector database in order to perform a semantic search. The source documents used are two Wikipedia documents. The discography and list of songs recorded by Bruce Springsteen are the documents used. The interesting part of these documents is…

Graphs and Language – DZone

A rising tide lifts all boats, and the recent advances in LLMs are no exception. In this blog post, we will explore how Knowledge Graphs can benefit from LLMs, and vice versa. In particular, Knowledge Graphs can ground LLMs with facts using Graph RAG, which can be cheaper than Vector RAG. We'll look at a 10-line code example in LlamaIndex and see how easy it is to start. LLMs can help build automated KGs, which have been a bottleneck in the past. Graphs can provide your Domain Experts with an interface to supervise your…

Harmonizing AI: Crafting Personalized Song Suggestions

Hello, fellow machine learning enthusiasts!  After several years of working as a Data Engineer, I've embarked on a new journey to delve into the diverse realms of machine learning. This article marks the beginning of my exploration through various projects aimed at learning and understanding this fascinating field. While I may be starting with Large Language Models (LLMs), I view it as an initial step to ignite my passion and motivation for this new endeavor. Join me as I dive into the world of machine learning, eager to…

LangChain, Python, and Heroku – DZone

Since the launch and wide adoption of ChatGPT near the end of 2022, we’ve seen a storm of news about tools, products, and innovations stemming from large language models (LLMs) and generative AI (GenAI). While many tech fads come and go within a few years, it’s clear that LLMs and GenAI are here to stay. Do you ever wonder about all the tooling going on in the background behind many of these new tools and products? In addition, you might even ask yourself how these tools—leveraged by both developer and end users—are run…

Mixtral: Generative Sparse Mixture of Experts in DataFlows

“The Mixtral-8x7B Large Language Model (LLM) is a pre-trained generative Sparse Mixture of Experts.” When I saw this come out it seemed pretty interesting and accessible, so I gave it a try. With the proper prompting, it seems good. I am not sure if it’s better than Google Gemma, Meta LLAMA2, or OLLAMA Mistral for my use cases. Today I will show you how to utilize the new Mixtral LLM with Apache NiFi. This will require only a few steps to run Mixtral against your text inputs. This model can be run by the lightweight…

Building Applications on Open Source LLMs

The computational complexity of AI models is growing exponentially, while the compute capability provided by hardware is growing linearly. Therefore, there is a growing gap between those two numbers, which can be seen as a supply and demand problem. On the demand side, we have everyone wanting to train or deploy an AI model. On the supply side, we have Nvidia and a number of competitors. Currently, the supply side is seeing earnings skyrocket, and the demand side is stockpiling and vying for access to compute. It's a…

Introduction to Modern AI 2024 Edition: Part 1

It's hard to believe it has been almost six years since I wrote my last article on Artificial Intelligence (AI), "Practical Artificial Intelligence." In that article, I gave an overview of the state of AI and Machine Learning (ML) and some popular usage and tools at the time. Since then, things have gotten crazy in the AI world: everyone is talking about tools like ChatGPT, but most people really don’t understand all the terminology and tools or what they are best suited for. In this article (part 1 of 2), I will attempt…

Microsoft Copilot Generates Demons When Prompted for Abortion Rights Images, Employee Says

Microsoft Engineer Shane Jones is warning the world that his company’s AI image generator, Copilot Designer, needs to be removed from public use. In an interview with CNBC Wednesday, Jones shares how the AI tool produces disturbing, strange images through basic prompts. He says Microsoft has largely ignored him, so he’s going public and asking government regulators to intervene.Warning! Microsoft Wants ChatGPT to Control Robots Next“This is really not a safe model,” Jones told CNBC. “Over the last three months, I have…

Anthropic says its new Claude 3 AI chatbot scores better on key benchmarks than GPT-4

The battle between AI chatbots is more than a two-horse race. Anthropic, the company formed by several ex-OpenAI employees, claims its new Claude 3 language model outperforms ChatGPT and Google's Gemini in several key industry benchmarks. It even hit "near-human" levels on some tasks, the company wrote in a blog.There are three new chatbots under the Claude 3 umbrella, including Haiku, Sonnet, and Opus. Sonnet powers the Claude.ai chatbot and is offered for free with an email sign-in. Meanwhile, Opus is the largest and…

ChatGPT Training Data: Best Practices and Tips

Mastering ChatGPT training data: Best practices and expert tips for enhanced performance ChatGPT is a conversational AI agent that can generate natural and engaging text responses for various purposes, such as customer service, entertainment, education, and more. ChatGPT is based on GPT-4, a large language model that can learn from any text data and produce coherent and relevant texts on any topic. Preparing Your Data The first step to training ChatGPT on your custom data is to prepare your data. This involves…