Spaces:
Sleeping
Sleeping
Update README.md
Browse files
README.md
CHANGED
|
@@ -1,14 +1,42 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
---
|
| 2 |
-
|
| 3 |
-
|
| 4 |
-
|
| 5 |
-
|
| 6 |
-
sdk: gradio
|
| 7 |
-
sdk_version: 5.49.0
|
| 8 |
-
app_file: app.py
|
| 9 |
-
pinned: false
|
| 10 |
-
license: mit
|
| 11 |
-
short_description: Multilingual YouTube chatbot powered by LangChain.
|
| 12 |
---
|
| 13 |
|
| 14 |
-
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
# ๐ง YouTube Transcript Chatbot (RAG) โ LLaMA, Gemma, DeepSeek, OpenAI
|
| 2 |
+
|
| 3 |
+
An advanced **Retrieval-Augmented Generation (RAG)** chatbot that answers questions from YouTube video transcripts using top open-source LLMs โ **Llama 3.2**, **Gemma 2**, **DeepSeek**, and **OpenAI GPT-OSS**.
|
| 4 |
+
|
| 5 |
+
Built with **Streamlit**, **LangChain**, **Hugging Face Hub**, and **FAISS**.
|
| 6 |
+
|
| 7 |
+
---
|
| 8 |
+
|
| 9 |
+
## ๐ Features
|
| 10 |
+
|
| 11 |
+
โ
Fetch YouTube transcripts via **RapidAPI**
|
| 12 |
+
โ
Embed and store transcript chunks using **FAISS**
|
| 13 |
+
โ
Query understanding through **LangChain retrieval pipeline**
|
| 14 |
+
โ
Supports **multiple Hugging Face chat-based models**:
|
| 15 |
+
- ๐ฆ **Llama-3.2-1B-Instruct**
|
| 16 |
+
- ๐ **Gemma-2-2B-it**
|
| 17 |
+
- ๐ง **DeepSeek-V3.2-Exp**
|
| 18 |
+
- ๐ฎ **OpenAI GPT-OSS-20B**
|
| 19 |
+
|
| 20 |
+
โ
Adjustable **temperature** for creative or factual outputs
|
| 21 |
+
โ
Seamless **Streamlit UI**
|
| 22 |
+
โ
Caching for faster repeat queries
|
| 23 |
+
|
| 24 |
---
|
| 25 |
+
|
| 26 |
+
## ๐งฉ Architecture
|
| 27 |
+
|
| 28 |
+
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 29 |
---
|
| 30 |
|
| 31 |
+
## ๐ง Tech Stack
|
| 32 |
+
|
| 33 |
+
| Component | Technology |
|
| 34 |
+
|------------|-------------|
|
| 35 |
+
| Frontend | Streamlit |
|
| 36 |
+
| Backend | LangChain |
|
| 37 |
+
| LLMs | Hugging Face Endpoints |
|
| 38 |
+
| Embeddings | intfloat/multilingual-e5-base |
|
| 39 |
+
| Vector Store | FAISS |
|
| 40 |
+
| API | YouTube Transcript API (RapidAPI) |
|
| 41 |
+
|
| 42 |
+
---
|