[Day 190] Learning about evaluating vector search engines for RAG apps

Hello :)
Today is Day 190!

All the code from today is on my repo.

The first part of the module was related to doing semantic search using dense vectors in Elasticsearch

1. Loaded Q&A documents from a json file

2. Created dense vectors for each document using a pre-trained model

3. Created an index in Elasticsearch

4. Indexed the documents in Elasticsearch

5. Performed a semantic search using the dense vectors

6. Filtered the results using a specific section

(picture is from the course)

Next I learned about evaluating the retrieval mechanism

1. Generate unique IDs for each document to distinguish each other

2. Generate 5 sample questions for each document using the GPT API

3. Save the results to a file to use for evaluation

first 10 rows from the created dataset:

The ID is needed to connect the sample created questions to the documenta they are related to.

Next, I learned about two evaluation metrics for evaluating the used search mechanism

Recall

Measures the number of relevant documents retrieved out of the total number of relevant documents available.
Formula: Recall = (Number of relevant documents retrieved) / (Total number of relevant documents)

Mean Reciprocal Rank

To do evaluation, a few different engines were created and compared based on recall and MRR

All ran with different times, so it is about evaluating what do we care more, speed vs a set % improvement

At the end, I completed the homework which covered similar questions like the above learned content.

That is all for today!

See you tomorrow :)