Posts

Showing posts from March 21, 2024

[Day 79] Attempting to make a Local Retrieval Augmented Generation (RAG) from Scratch

Image
 Hello :) Today is Day 79! A quick summary of today: Started this tutorial for building a local RAG from scratch and dealt with issues due to me being on a mac A 1 line summary of the tutorial: wow!!! amazing!!! haha The tutorial is focused on cuda, but instead of colab, I wanted to try to do this locally: 1. to see the limits of my m2 pro 2. to deal with issues not shown in the video I am almost at the end - 4:49:34 / 5:40:58, but it is 2am and I am falling asleep on the keyboard even writing this post now.  The tutorial is pretty amazing, it follows the below graph The pdf used for the tutorial is a 1200 page book about nutrition, and the goal is to create a model with which we can talk and ask questions about nutrition. My thoughts of using this are afterwards I would use research papers, and test it with them.  The steps to success are: Open a PDF doc (or collection of PDFs) Format the text, ready for an embedding model Embed the chunks of text and turn into embeddings Build a ret