summerstars commited on
Commit
517258d
·
verified ·
1 Parent(s): 9747bfe

Create README.md

Browse files
Files changed (1) hide show
  1. README.md +67 -0
README.md ADDED
@@ -0,0 +1,67 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: apache-2.0
3
+ base_model:
4
+ - HuggingFaceTB/SmolLM2-360M-Instruct
5
+ language:
6
+ - en
7
+ pipeline_tag: text-generation
8
+ tags:
9
+ - safetensors
10
+ - onnx
11
+ - transformers
12
+ ---
13
+
14
+ # 🌞 SolaraV2 — `summerstars/SolaraV2`
15
+
16
+ ## ✨ Created by a High School Student | Built on Google Colab (T4 GPU)
17
+ ### 🌸 高校生によって開発 | Google Colab(T4 GPU)で作成
18
+
19
+ **SolaraV2** is an upgraded version of the original **Solara** — a lightweight, instruction-tuned language model based on [`HuggingFaceTB/SmolLM2-360M-Instruct`](https://huggingface.co/HuggingFaceTB/SmolLM2-360M-Instruct).
20
+ This version is trained on a **larger and more diverse dataset**, including **basic math-related samples**, improving its ability to handle both casual conversations and educational tasks.
21
+ All development was conducted by a high school student using **Google Colab** and a **T4 GPU**.
22
+
23
+ **SolaraV2(ソララV2)** は、オリジナルの **Solara** モデルを改良した軽量の言語モデルで、[`HuggingFaceTB/SmolLM2-360M-Instruct`](https://huggingface.co/HuggingFaceTB/SmolLM2-360M-Instruct) をベースにしています。
24
+ 本バージョンでは、**より大規模かつ多様なデータセット**(数学系データを含む)で学習を行い、日常会話から教育的な質問まで幅広く対応できるようになりました。
25
+ 開発はすべて、高校生が **Google Colab(T4 GPU)** 上で行いました。
26
+
27
+ ---
28
+
29
+ ## 📌 Model Details | モデル詳細
30
+
31
+ | Feature / 特徴 | Description / 説明 |
32
+ |--------------------|------------------|
33
+ | **Base Model** | `HuggingFaceTB/SmolLM2-360M-Instruct` |
34
+ | **Parameters** | 360M |
35
+ | **Architecture** | Decoder-only Transformer |
36
+ | **Language** | English / 英語 |
37
+ | **License** | Apache 2.0 |
38
+ | **Training Additions** | Basic math, factual Q&A / 基本数学・事実ベースのデータ追加 |
39
+
40
+ ---
41
+
42
+ ## 🚀 Use Cases | 主な用途
43
+
44
+ - 🤖 Lightweight chatbots / 軽量チャットボット
45
+ - 📱 Inference on CPUs or mobile devices / CPUやモバイル端末での推論
46
+ - 📚 Educational or hobbyist projects / 教育・趣味向けプロジェクト
47
+ - 🧾 Instruction-following tasks / 指示応答タスク
48
+ - ➗ Basic math questions / 基本的な数学問題への対応
49
+
50
+ ---
51
+
52
+ ## 🛠️ How to Use | 使用方法
53
+
54
+ ```python
55
+ from transformers import AutoTokenizer, AutoModelForCausalLM
56
+
57
+ model_name = "summerstars/SolaraV2"
58
+
59
+ tokenizer = AutoTokenizer.from_pretrained(model_name)
60
+ model = AutoModelForCausalLM.from_pretrained(model_name)
61
+
62
+ prompt = "What is 15 * 4?"
63
+ inputs = tokenizer(prompt, return_tensors="pt")
64
+ outputs = model.generate(**inputs, max_new_tokens=64)
65
+
66
+ # Print the result / 結果を表示
67
+ print(tokenizer.decode(outputs[0], skip_special_tokens=True))