Unlocking Data with Generative AI and RAG

By : Keith Bourne

5 (2)

Buy this Book

Unlocking Data with Generative AI and RAG

5 (2)

By: Keith Bourne

Buy this Book

Overview of this book

Generative AI is helping organizations tap into their data in new ways, with retrieval-augmented generation (RAG) combining the strengths of large language models (LLMs) with internal data for more intelligent and relevant AI applications. The author harnesses his decade of ML experience in this book to equip you with the strategic insights and technical expertise needed when using RAG to drive transformative outcomes. The book explores RAG’s role in enhancing organizational operations by blending theoretical foundations with practical techniques. You’ll work with detailed coding examples using tools such as LangChain and Chroma’s vector database to gain hands-on experience in integrating RAG into AI systems. The chapters contain real-world case studies and sample applications that highlight RAG’s diverse use cases, from search engines to chatbots. You’ll learn proven methods for managing vector databases, optimizing data retrieval, effective prompt engineering, and quantitatively evaluating performance. The book also takes you through advanced integrations of RAG with cutting-edge AI agents and emerging non-LLM technologies. By the end of this book, you’ll be able to successfully deploy RAG in business settings, address common challenges, and push the boundaries of what’s possible with this revolutionary AI technique.

Preface

Who this book is for

What this book covers

To get the most out of this book

Download the example code files

Conventions used

Get in touch

Share Your Thoughts

Download a free PDF copy of this book

Free Chapter

Part 1 – Introduction to Retrieval-Augmented Generation (RAG)

Chapter 1: What Is Retrieval-Augmented Generation (RAG)

Understanding RAG – Basics and principles

RAG vocabulary

Vectors

Implementing RAG in AI applications

Comparing RAG with conventional generative AI

Comparing RAG with model fine-tuning

The architecture of RAG systems

Summary

Chapter 2: Code Lab – An Entire RAG Pipeline

Technical requirements

No interface!

Setting up a large language model (LLM) account

Installing the necessary packages

Indexing

Retrieval and generation

Submitting a question for RAG

Final output

Complete code

Summary

Chapter 3: Practical Applications of RAG

Technical requirements

Customer support and chatbots with RAG

RAG for automated reporting

E-commerce support

Utilizing knowledge bases with RAG

Innovation scouting and trend analysis

Leveraging RAG for personalized recommendations in marketing communications

Training and education

Code lab 3.1 – Adding sources to your RAG

Summary

Chapter 4: Components of a RAG System

Technical requirements

Key component overview

Indexing

Retrieval and generation

Prompting

Defining your LLM

Evaluation

Summary

References

Chapter 5: Managing Security in RAG Applications

Technical requirements

How RAG can be leveraged as a security solution

RAG security challenges

Red teaming

Common areas to target with red teaming

Code lab 5.1 – Securing your keys

Code lab 5.2 – Red team attack!

Code lab 5.3 – Blue team defend!

Summary

Part 2 – Components of RAG

Chapter 6: Interfacing with RAG and Gradio

Technical requirements

Why Gradio?

Benefits of using Gradio

Limitations to using Gradio

Code lab – Adding a Gradio interface

Summary

Chapter 7: The Key Role Vectors and Vector Stores Play in RAG

Technical requirements

Fundamentals of vectors in RAG

Where vectors lurk in your code

The amount of text you vectorize matters!

Not all semantics are created equal!

Code lab 7.1 – Common vectorization techniques

Factors in selecting a vectorization option

Getting started with vector stores

Choosing a vector store

Summary

Chapter 8: Similarity Searching with Vectors

Technical requirements

Distance metrics versus similarity algorithms versus vector search

Vector space

Semantic versus keyword search

Code lab 8.1 – Semantic distance metrics

Different search paradigms – sparse, dense, and hybrid

Code lab 8.2 – Hybrid search with a custom function

Code lab 8.3 – Hybrid search with LangChain’s EnsembleRetriever to replace our custom function

Semantic search algorithms

Enhancing search with indexing techniques

Vector search options

Summary

Chapter 9: Evaluating RAG Quantitatively and with Visualizations

Technical requirements

Evaluate as you build

Evaluate after you deploy

Evaluation helps you get better

Standardized evaluation frameworks

What is the ground truth?

Code lab 9.1 – ragas

End-to-end evaluation

Other component-wise evaluation

Additional evaluation techniques

Summary

References

Chapter 10: Key RAG Components in LangChain

Technical requirements

Code lab 10.1 – LangChain vector store

Code lab 10.2 – LangChain Retrievers

Code lab 10.3 – LangChain LLMs

Summary

Chapter 11: Using LangChain to Get More from RAG

Technical requirements

Code lab 11.1 – Document loaders

Code lab 11.2 – Text splitters

Code lab 11.3 – Output parsers

Summary

Part 3 – Implementing Advanced RAG

Chapter 12: Combining RAG with the Power of AI Agents and LangGraph

Technical requirements

Fundamentals of AI agents and RAG integration

Graphs, AI agents, and LangGraph

Code lab 12.1 – adding a LangGraph agent to RAG

Core concepts of graph theory

Nodes and edges in our agent

Cyclical graph setup

Summary

Chapter 13: Using Prompt Engineering to Improve RAG Efforts

Technical requirements

Prompt parameters

Take your shot

Prompting, prompt design, and prompt engineering revisited

Prompt design versus engineering approaches

Fundamentals of prompt design

Adapting prompts for different LLMs

Code lab 13.1 – Custom prompt template

Code lab 13.2 – Prompting options

Summary

Chapter 14: Advanced RAG-Related Techniques for Improving Results

Technical requirements

Naïve RAG and its limitations

Hybrid RAG/multi-vector RAG for improved retrieval

Re-ranking in hybrid RAG

Code lab 14.1 – Query expansion

Code lab 14.2 – Query decomposition

Code lab 14.3 – MM-RAG

Other advanced RAG techniques to explore

Summary

Index

Why subscribe?

Other Books You May Enjoy

Packt is searching for authors like you

Share Your Thoughts

Download a free PDF copy of this book

Customer Reviews

5 (2)

5 star

100%

4 star

3 star

2 star

1 star

Unlocking Data with Generative AI and RAG

By : Keith Bourne

Unlocking Data with Generative AI and RAG

By: Keith Bourne

Overview of this book

The architecture of RAG systems

Delete Bookmark