Pretrained LLMs from scratch.
Y. Yu
PursuitOfDataScience
AI & ML interests
LLM, GPU Computing, PyTorch
Recent Activity
updated a dataset 7 days ago
PursuitOfDataScience/openmath-reasoning-medley updated a collection 8 days ago
ArgonneAI updated a model 8 days ago
PursuitOfDataScience/Argonne-2.5-ctx13568-instructOrganizations
None yet
Sandbox Models
Trial & Error models for various tasks.
-
PursuitOfDataScience/roberta-large-ner
Token Classification • 0.4B • Updated • 2 -
PursuitOfDataScience/distilbert-base-cased-ner
Token Classification • 65.2M • Updated • 1 -
PursuitOfDataScience/bert-base-ner
Token Classification • 0.1B • Updated • 6 -
PursuitOfDataScience/t5-large-summary-model
0.7B • Updated • 2
ArgonneAI
Pretrained LLMs from scratch.
Sandbox Models
Trial & Error models for various tasks.
-
PursuitOfDataScience/roberta-large-ner
Token Classification • 0.4B • Updated • 2 -
PursuitOfDataScience/distilbert-base-cased-ner
Token Classification • 65.2M • Updated • 1 -
PursuitOfDataScience/bert-base-ner
Token Classification • 0.1B • Updated • 6 -
PursuitOfDataScience/t5-large-summary-model
0.7B • Updated • 2
models 31
PursuitOfDataScience/Argonne-2.5-ctx13568-instruct
Text Generation • 1B • Updated • 211
PursuitOfDataScience/Argonne-2.5-ctx13568
Text Generation • Updated • 535
PursuitOfDataScience/Argonne2.5-instruct
Text Generation • 1B • Updated • 974
PursuitOfDataScience/Argonne2.5-base
Text Generation • 1B • Updated • 729
PursuitOfDataScience/Qwen3.5-0.8B-Opus-4.6-thinking
Text Generation • 0.8B • Updated • 11 • 2
PursuitOfDataScience/Qwen3.5-0.8B-thinking
Text Generation • 0.8B • Updated • 11
PursuitOfDataScience/llama3.2-3b-thinking
Updated • 3
PursuitOfDataScience/Qwen3-0.6b-thinking
Text Generation • Updated • 3
PursuitOfDataScience/Llama-3.2-1B-GRPO
Text Generation • 1B • Updated • 5
PursuitOfDataScience/Argonne-2.0
Text Generation • 6B • Updated • 4
datasets 47
PursuitOfDataScience/openmath-reasoning-medley
Viewer • Updated • 3.66M • 677
PursuitOfDataScience/dream-of-the-red-chamber-continuations
Viewer • Updated • 92 • 67 • 1
PursuitOfDataScience/oss-code-seeds
Viewer • Updated • 314k • 16
PursuitOfDataScience/toucan-agentic-thinking
Viewer • Updated • 119k • 12
PursuitOfDataScience/arxiv-qa-thinking
Viewer • Updated • 215k • 39
PursuitOfDataScience/0.9M-thinking
Viewer • Updated • 898k • 24
PursuitOfDataScience/0.5M-thinking
Viewer • Updated • 499k • 30
PursuitOfDataScience/MiniMax-M2.1-Mixture-of-Thoughts
Viewer • Updated • 349k • 282 • 2
PursuitOfDataScience/gsm8k-thinking
Viewer • Updated • 8.79k • 67
PursuitOfDataScience/bbc-news-llama4-maverick-summary
Viewer • Updated • 174k • 8