fixed rag source format

steven-ja · steven-ja · commit 3e7e37db9b67 · 2024-07-14T16:10:29.000+02:00
diff --git a/content/posts/machine learning/deep learning/NLP/Gemma2+RAG/index.md b/content/posts/machine learning/deep learning/NLP/Gemma2+RAG/index.md
@@ -89,7 +89,7 @@ import faiss
 ```
 
 
-## Data Loading
+## 4. Data Loading
 * Use *SimpleDirectoryReader* from llama_index.
 
 
@@ -98,7 +98,7 @@ import faiss
 documents = SimpleDirectoryReader('/kaggle/input/superconductivity-lectures/').load_data()
 ```
 
-## Load Embedding Model
+## 5. Load Embedding Model
 * It uses the "sentence-transformers/all-MiniLM-L6-v2" model to create vector representations of text.
 * This model is known for its efficiency in creating semantic embeddings.
 
@@ -110,7 +110,7 @@ embed_model = HuggingFaceEmbedding(model_name="sentence-transformers/all-MiniLM-
 
 
 
-## 4. Language Model Setup and Loading
+## 6. Language Model Setup and Loading
 * It uses the "google/gemma-2-9b-it" model, a powerful instruction-tuned language model.
 * It configures 8-bit quantization to reduce memory usage
 * The tokenizer is set globally for consistency.
@@ -135,7 +135,7 @@ llm_model = HuggingFaceLLM(model_name="google/gemma-2-9b-it",
 
 
 
-## 5. Direct LLM Querying
+## 7. Direct LLM Querying
 This part demonstrates direct querying of the LLM:
 
 * It defines a list of queries about superconductivity.
@@ -326,7 +326,7 @@ Let me know if you'd like me to calculate the numerical value of the magnetic fi
 
 
 
-## 6. Vector Store and Index Creation
+## 8. Vector Store and Index Creation
 This section sets up the vector store and creates the index:
 
 * It initializes a FAISS index with the embedding dimension of 384 (the same as the embedding model)
@@ -353,7 +353,7 @@ index.storage_context.persist()
 
 
 
-## 7. RAG Querying
+## 9. RAG Querying
 * Compare these results with the previous Direct LLM queries
 * The default *similarity_top_k* values is 3. However, I set it up to 5 to have more exhaustive answers.
 * We expect more accurate and truthful answers. Anyway, when asked about London Equations, they are wrong. Also in the first query, direct LLM provides only few scientists but do not quote "Josephson" in any case (even after multiple generation). 
@@ -385,6 +385,7 @@ for i, resp in enumerate(rag_responses):
 
 
 <span style="font-size:1.5em;font-weight:700">   Which scientists contributed the most to superconductivity? </span>
+
 Sources: _['Lecture1.pdf', 'Lecture1.pdf', 'Lecture1.pdf', 'Lecture1.pdf', 'Lecture1.pdf']_
 
 
@@ -411,6 +412,7 @@ The text emphasizes the importance of understanding the microscopic mechanism of
 
 
 <span style="font-size:1.5em;font-weight:700">   Which are the differences between Type-I and Type-II superconductors? Describe magnetical properties and show formulas. </span>
+
 Sources: _['Lecture2.pdf', 'Lecture2.pdf', 'Lecture1.pdf', 'Lecture3.pdf', 'Lecture1.pdf']_
 
 
@@ -459,6 +461,7 @@ Let me know if you have any other questions.
 
 
 <span style="font-size:1.5em;font-weight:700">   What are the London Equation? Why are they important? </span>
+
 Sources: _['Lecture1.pdf', 'Lecture3.pdf', 'Lecture3.pdf', 'Lecture3.pdf', 'Lecture1.pdf']_
 
 
@@ -497,8 +500,8 @@ The provided text highlights the historical development of superconductivity the
 
 
 
-
 <span style="font-size:1.5em;font-weight:700">   Solve this problem: Consider a bulk superconductor containing a cylindrical hole of 0.1 mm diameter. There are 7 magnetic flux quanta trapped in the hole. Find the magnetic field in the hole.</span>
+
 Sources: _['Lecture3.pdf', 'Lecture3.pdf', 'Lecture3.pdf', 'Lecture3.pdf', 'Lecture3.pdf']_
 
 
@@ -540,9 +543,9 @@ Substitute the values of Φ0, d, and π into the equation to obtain the numerica
 
 Let me know if you have any further questions.
 
+---
 
-
-## 8. Conclusion
+## 10. Conclusion
 This implementation demonstrates the power of RAG in combining the strengths of large language models with the ability to retrieve and utilize specific, relevant information. By using FAISS for efficient similarity search and a state-of-the-art language model like Gemma-2-9b, this system can provide informed, context-aware responses to complex queries about superconductivity.
 The comparison between direct LLM responses and RAG responses would likely show the benefits of RAG in providing more detailed, accurate, and source-backed information. This approach is particularly valuable in domains requiring up-to-date or specialized knowledge, where the LLM's pre-trained knowledge might be insufficient or outdated.
 
diff --git a/public/index.json b/public/index.json
diff --git a/public/posts/machine-learning/deep-learning/nlp/gemma2+rag/index.html b/public/posts/machine-learning/deep-learning/nlp/gemma2+rag/index.html
@@ -575,20 +575,20 @@ <h2 id="2-setup-and-import">2. Setup and Import</h2>
 </span></span><span style="display:flex;"><span>
 </span></span><span style="display:flex;"><span><span style="color:#f92672">from</span> llama_index.vector_stores.faiss <span style="color:#f92672">import</span> FaissVectorStore
 </span></span><span style="display:flex;"><span><span style="color:#f92672">import</span> faiss
-</span></span></code></pre></div><h2 id="data-loading">Data Loading</h2>
+</span></span></code></pre></div><h2 id="4-data-loading">4. Data Loading</h2>
 <ul>
 <li>Use <em>SimpleDirectoryReader</em> from llama_index.</li>
 </ul>
 <div class="highlight"><pre tabindex="0" style="color:#f8f8f2;background-color:#272822;-moz-tab-size:4;-o-tab-size:4;tab-size:4;"><code class="language-python" data-lang="python"><span style="display:flex;"><span><span style="color:#75715e">#  Load the PDF</span>
 </span></span><span style="display:flex;"><span>documents <span style="color:#f92672">=</span> SimpleDirectoryReader(<span style="color:#e6db74">&#39;/kaggle/input/superconductivity-lectures/&#39;</span>)<span style="color:#f92672">.</span>load_data()
-</span></span></code></pre></div><h2 id="load-embedding-model">Load Embedding Model</h2>
+</span></span></code></pre></div><h2 id="5-load-embedding-model">5. Load Embedding Model</h2>
 <ul>
 <li>It uses the &ldquo;sentence-transformers/all-MiniLM-L6-v2&rdquo; model to create vector representations of text.</li>
 <li>This model is known for its efficiency in creating semantic embeddings.</li>
 </ul>
 <div class="highlight"><pre tabindex="0" style="color:#f8f8f2;background-color:#272822;-moz-tab-size:4;-o-tab-size:4;tab-size:4;"><code class="language-python" data-lang="python"><span style="display:flex;"><span><span style="color:#75715e"># Load embedding model</span>
 </span></span><span style="display:flex;"><span>embed_model <span style="color:#f92672">=</span> HuggingFaceEmbedding(model_name<span style="color:#f92672">=</span><span style="color:#e6db74">&#34;sentence-transformers/all-MiniLM-L6-v2&#34;</span>)
-</span></span></code></pre></div><h2 id="4-language-model-setup-and-loading">4. Language Model Setup and Loading</h2>
+</span></span></code></pre></div><h2 id="6-language-model-setup-and-loading">6. Language Model Setup and Loading</h2>
 <ul>
 <li>It uses the &ldquo;google/gemma-2-9b-it&rdquo; model, a powerful instruction-tuned language model.</li>
 <li>It configures 8-bit quantization to reduce memory usage</li>
@@ -608,7 +608,7 @@ <h2 id="2-setup-and-import">2. Setup and Import</h2>
 </span></span><span style="display:flex;"><span>                           generate_kwargs<span style="color:#f92672">=</span>{<span style="color:#e6db74">&#34;temperature&#34;</span>: <span style="color:#ae81ff">1</span>, <span style="color:#e6db74">&#34;num_return_sequences&#34;</span>:<span style="color:#ae81ff">1</span>, <span style="color:#e6db74">&#34;do_sample&#34;</span>: <span style="color:#66d9ef">False</span>},
 </span></span><span style="display:flex;"><span>                           model_kwargs<span style="color:#f92672">=</span>{<span style="color:#e6db74">&#34;quantization_config&#34;</span>: quantization_config},
 </span></span><span style="display:flex;"><span>                           device_map<span style="color:#f92672">=</span><span style="color:#e6db74">&#39;auto&#39;</span>)
-</span></span></code></pre></div><h2 id="5-direct-llm-querying">5. Direct LLM Querying</h2>
+</span></span></code></pre></div><h2 id="7-direct-llm-querying">7. Direct LLM Querying</h2>
 <p>This part demonstrates direct querying of the LLM:</p>
 <ul>
 <li>It defines a list of queries about superconductivity.</li>
@@ -788,7 +788,7 @@ <h2 id="2-setup-and-import">2. Setup and Import</h2>
 </ul>
 <p>Calculate the magnetic field (B) using these values.</p>
 <p>Let me know if you&rsquo;d like me to calculate the numerical value of the magnetic field.</p>
-<h2 id="6-vector-store-and-index-creation">6. Vector Store and Index Creation</h2>
+<h2 id="8-vector-store-and-index-creation">8. Vector Store and Index Creation</h2>
 <p>This section sets up the vector store and creates the index:</p>
 <ul>
 <li>It initializes a FAISS index with the embedding dimension of 384 (the same as the embedding model)</li>
@@ -807,7 +807,7 @@ <h2 id="6-vector-store-and-index-creation">6. Vector Store and Index Creation</h
 </span></span><span style="display:flex;"><span>
 </span></span><span style="display:flex;"><span><span style="color:#75715e"># save the vector store locally</span>
 </span></span><span style="display:flex;"><span>index<span style="color:#f92672">.</span>storage_context<span style="color:#f92672">.</span>persist()
-</span></span></code></pre></div><h2 id="7-rag-querying">7. RAG Querying</h2>
+</span></span></code></pre></div><h2 id="9-rag-querying">9. RAG Querying</h2>
 <ul>
 <li>Compare these results with the previous Direct LLM queries</li>
 <li>The default <em>similarity_top_k</em> values is 3. However, I set it up to 5 to have more exhaustive answers.</li>
@@ -825,8 +825,8 @@ <h2 id="6-vector-store-and-index-creation">6. Vector Store and Index Creation</h
 </span></span><span style="display:flex;"><span>    <span style="color:#66d9ef">for</span> node <span style="color:#f92672">in</span> resp<span style="color:#f92672">.</span>source_nodes:
 </span></span><span style="display:flex;"><span>        sources<span style="color:#f92672">.</span>append(node<span style="color:#f92672">.</span>metadata[<span style="color:#e6db74">&#39;file_name&#39;</span>])
 </span></span><span style="display:flex;"><span>    display(Markdown(<span style="color:#e6db74">&#34;## &#34;</span> <span style="color:#f92672">+</span> queries[i] <span style="color:#f92672">+</span> <span style="color:#e6db74">&#34;</span><span style="color:#ae81ff">\n</span><span style="color:#e6db74">&#34;</span> <span style="color:#f92672">+</span> <span style="color:#e6db74">f</span><span style="color:#e6db74">&#34;Sources: _</span><span style="color:#e6db74">{</span>sources<span style="color:#e6db74">}</span><span style="color:#e6db74">_</span><span style="color:#ae81ff">\n</span><span style="color:#e6db74">&#34;</span> <span style="color:#f92672">+</span> resp<span style="color:#f92672">.</span>response))
-</span></span></code></pre></div><p><span style="font-size:1.5em;font-weight:700">   Which scientists contributed the most to superconductivity? </span>
-Sources: <em>[&lsquo;Lecture1.pdf&rsquo;, &lsquo;Lecture1.pdf&rsquo;, &lsquo;Lecture1.pdf&rsquo;, &lsquo;Lecture1.pdf&rsquo;, &lsquo;Lecture1.pdf&rsquo;]</em></p>
+</span></span></code></pre></div><p><span style="font-size:1.5em;font-weight:700">   Which scientists contributed the most to superconductivity? </span></p>
+<p>Sources: <em>[&lsquo;Lecture1.pdf&rsquo;, &lsquo;Lecture1.pdf&rsquo;, &lsquo;Lecture1.pdf&rsquo;, &lsquo;Lecture1.pdf&rsquo;, &lsquo;Lecture1.pdf&rsquo;]</em></p>
 <p>Based on the provided text, the scientists who contributed the most to superconductivity are:</p>
 <ul>
 <li><strong>Heike Kamerlingh Onnes:</strong> Discovered superconductivity.</li>
@@ -841,8 +841,8 @@ <h2 id="6-vector-store-and-index-creation">6. Vector Store and Index Creation</h
 <li><strong>John Cooper:</strong>  His work on electron pairing in superconductors was crucial for the development of the BCS theory.</li>
 </ul>
 <p>The text emphasizes the importance of understanding the microscopic mechanism of superconductivity, highlighting the contributions of Cooper and the development of the BCS theory. It also provides some insights into why certain materials, like noble metals, do not become superconductors.</p>
-<p><span style="font-size:1.5em;font-weight:700">   Which are the differences between Type-I and Type-II superconductors? Describe magnetical properties and show formulas. </span>
-Sources: <em>[&lsquo;Lecture2.pdf&rsquo;, &lsquo;Lecture2.pdf&rsquo;, &lsquo;Lecture1.pdf&rsquo;, &lsquo;Lecture3.pdf&rsquo;, &lsquo;Lecture1.pdf&rsquo;]</em></p>
+<p><span style="font-size:1.5em;font-weight:700">   Which are the differences between Type-I and Type-II superconductors? Describe magnetical properties and show formulas. </span></p>
+<p>Sources: <em>[&lsquo;Lecture2.pdf&rsquo;, &lsquo;Lecture2.pdf&rsquo;, &lsquo;Lecture1.pdf&rsquo;, &lsquo;Lecture3.pdf&rsquo;, &lsquo;Lecture1.pdf&rsquo;]</em></p>
 <p>Superconductors can be divided into two groups, Type-I and Type-II, characterized by their different responses to external magnetic fields. This classification is crucial in understanding the behavior of superconductors in various applications.</p>
 <p><strong>Type-I Superconductors:</strong></p>
 <ul>
@@ -893,8 +893,8 @@ <h2 id="6-vector-store-and-index-creation">6. Vector Store and Index Creation</h
 </li>
 </ul>
 <p>Let me know if you have any other questions.</p>
-<p><span style="font-size:1.5em;font-weight:700">   What are the London Equation? Why are they important? </span>
-Sources: <em>[&lsquo;Lecture1.pdf&rsquo;, &lsquo;Lecture3.pdf&rsquo;, &lsquo;Lecture3.pdf&rsquo;, &lsquo;Lecture3.pdf&rsquo;, &lsquo;Lecture1.pdf&rsquo;]</em></p>
+<p><span style="font-size:1.5em;font-weight:700">   What are the London Equation? Why are they important? </span></p>
+<p>Sources: <em>[&lsquo;Lecture1.pdf&rsquo;, &lsquo;Lecture3.pdf&rsquo;, &lsquo;Lecture3.pdf&rsquo;, &lsquo;Lecture3.pdf&rsquo;, &lsquo;Lecture1.pdf&rsquo;]</em></p>
 <p>The London equations are a set of two fundamental equations that describe the behavior of superconductors in electromagnetic fields. They are:</p>
 <ul>
 <li><strong>Equation (3.6):</strong> ∇²H = - (4π/λ²)J<sub>s</sub></li>
@@ -930,8 +930,8 @@ <h2 id="6-vector-store-and-index-creation">6. Vector Store and Index Creation</h
 </ul>
 <p><strong>Contextual Connection:</strong></p>
 <p>The provided text highlights the historical development of superconductivity theory, culminating in the BCS theory. The London equations, while a simplified model, played a crucial role in laying the groundwork for these later, more sophisticated theories. They provided the first concrete explanation for the phenomenon of perfect diamagnetism and zero resistance, paving the way for a deeper understanding of superconductivity.</p>
-<p><span style="font-size:1.5em;font-weight:700">   Solve this problem: Consider a bulk superconductor containing a cylindrical hole of 0.1 mm diameter. There are 7 magnetic flux quanta trapped in the hole. Find the magnetic field in the hole.</span>
-Sources: <em>[&lsquo;Lecture3.pdf&rsquo;, &lsquo;Lecture3.pdf&rsquo;, &lsquo;Lecture3.pdf&rsquo;, &lsquo;Lecture3.pdf&rsquo;, &lsquo;Lecture3.pdf&rsquo;]</em></p>
+<p><span style="font-size:1.5em;font-weight:700">   Solve this problem: Consider a bulk superconductor containing a cylindrical hole of 0.1 mm diameter. There are 7 magnetic flux quanta trapped in the hole. Find the magnetic field in the hole.</span></p>
+<p>Sources: <em>[&lsquo;Lecture3.pdf&rsquo;, &lsquo;Lecture3.pdf&rsquo;, &lsquo;Lecture3.pdf&rsquo;, &lsquo;Lecture3.pdf&rsquo;, &lsquo;Lecture3.pdf&rsquo;]</em></p>
 <p>To solve this problem, we can use the concept of magnetic flux quantization in superconductors.</p>
 <p><strong>1. Magnetic Flux Quantization:</strong></p>
 <p>Each flux quantum (Φ0) is given by:</p>
@@ -954,7 +954,8 @@ <h2 id="6-vector-store-and-index-creation">6. Vector Store and Index Creation</h
 <p><strong>3. Numerical Calculation:</strong></p>
 <p>Substitute the values of Φ0, d, and π into the equation to obtain the numerical value of the magnetic field in the hole.</p>
 <p>Let me know if you have any further questions.</p>
-<h2 id="8-conclusion">8. Conclusion</h2>
+<hr>
+<h2 id="10-conclusion">10. Conclusion</h2>
 <p>This implementation demonstrates the power of RAG in combining the strengths of large language models with the ability to retrieve and utilize specific, relevant information. By using FAISS for efficient similarity search and a state-of-the-art language model like Gemma-2-9b, this system can provide informed, context-aware responses to complex queries about superconductivity.
 The comparison between direct LLM responses and RAG responses would likely show the benefits of RAG in providing more detailed, accurate, and source-backed information. This approach is particularly valuable in domains requiring up-to-date or specialized knowledge, where the LLM&rsquo;s pre-trained knowledge might be insufficient or outdated.</p>
 
@@ -1121,13 +1122,13 @@ <h5 class="text-center ps-3">Table of Contents</h5>
     <li><a href="#1-introduction">1. Introduction</a></li>
     <li><a href="#2-setup-and-import">2. Setup and Import</a></li>
     <li><a href="#3-model-and-vectordb-imports">3. Model and VectorDB imports</a></li>
-    <li><a href="#data-loading">Data Loading</a></li>
-    <li><a href="#load-embedding-model">Load Embedding Model</a></li>
-    <li><a href="#4-language-model-setup-and-loading">4. Language Model Setup and Loading</a></li>
-    <li><a href="#5-direct-llm-querying">5. Direct LLM Querying</a></li>
-    <li><a href="#6-vector-store-and-index-creation">6. Vector Store and Index Creation</a></li>
-    <li><a href="#7-rag-querying">7. RAG Querying</a></li>
-    <li><a href="#8-conclusion">8. Conclusion</a></li>
+    <li><a href="#4-data-loading">4. Data Loading</a></li>
+    <li><a href="#5-load-embedding-model">5. Load Embedding Model</a></li>
+    <li><a href="#6-language-model-setup-and-loading">6. Language Model Setup and Loading</a></li>
+    <li><a href="#7-direct-llm-querying">7. Direct LLM Querying</a></li>
+    <li><a href="#8-vector-store-and-index-creation">8. Vector Store and Index Creation</a></li>
+    <li><a href="#9-rag-querying">9. RAG Querying</a></li>
+    <li><a href="#10-conclusion">10. Conclusion</a></li>
   </ul>
 </nav>
       </div>