26 46

小明

xiaoming

xiaominghero

AI & ML interests

nlp

Recent Activity

liked a dataset about 1 month ago

stepfun-ai/Step-3.5-Flash-SFT

liked a dataset about 1 month ago

nvidia/Nemotron-Pretraining-Code-v1

upvoted an article about 1 month ago

Code Concepts: A Large-Scale Synthetic Dataset Generated from Programming Concept Seeds

View all activity

Organizations

None yet

liked 2 datasets about 1 month ago

stepfun-ai/Step-3.5-Flash-SFT

Viewer • Updated Mar 14 • 1.62M • 36.9k • 321

nvidia/Nemotron-Pretraining-Code-v1

Viewer • Updated Dec 23, 2025 • 936M • 11.2k • 65

upvoted an article about 1 month ago

Article

Code Concepts: A Large-Scale Synthetic Dataset Generated from Programming Concept Seeds

Mar 11

•

liked 2 models about 2 months ago

stepfun-ai/Step-3.5-Flash-Base-Midtrain

Text Generation • 198B • Updated Mar 9 • 174 • 40

stepfun-ai/Step-3.5-Flash-Base

Text Generation • 198B • Updated Mar 9 • 252 • 83

upvoted a paper 2 months ago

Step 3.5 Flash: Open Frontier-Level Intelligence with 11B Active Parameters

Paper • 2602.10604 • Published Feb 11 • 195

upvoted a collection 2 months ago

UltraData

Collection

Ultra Scale, Ultra Quality, Ultra Coverage • 10 items • Updated 3 days ago • 81

liked 3 models 3 months ago

upvoted 2 papers 3 months ago

STEP3-VL-10B Technical Report

Paper • 2601.09668 • Published Jan 14 • 195

PaCoRe: Learning to Scale Test-Time Compute with Parallel Coordinated Reasoning

Paper • 2601.05593 • Published Jan 9 • 86

upvoted an article 3 months ago

Article

The 1 Billion Token Challenge: Finding the Perfect Pre-training Mix

Nov 3, 2025

•

upvoted 2 papers 4 months ago

Step-DeepResearch Technical Report

Paper • 2512.20491 • Published Dec 23, 2025 • 87

Step-GUI Technical Report

Paper • 2512.15431 • Published Dec 17, 2025 • 133

upvoted an article 4 months ago

Article

We Got Claude to Fine-Tune an Open Source LLM

Dec 4, 2025

•

621

liked a Space 6 months ago

The Smol Training Playbook

📚

3.11k

The secrets to building world-class LLMs

liked a dataset 6 months ago

allenai/CoSyn-400K

Viewer • Updated Feb 28, 2025 • 408k • 1.83k • 48

upvoted a collection 7 months ago

MobileLLM-R1

Collection

MobileLLM-R1, a series of sub-billion parameter reasoning models • 10 items • Updated Nov 21, 2025 • 30

liked a dataset 7 months ago

allenai/WildChat-4.8M

Viewer • Updated Aug 11, 2025 • 3.2M • 7.36k • 141

小明

AI & ML interests

Recent Activity

Organizations

xiaoming's activity

Code Concepts: A Large-Scale Synthetic Dataset Generated from Programming Concept Seeds

The 1 Billion Token Challenge: Finding the Perfect Pre-training Mix

We Got Claude to Fine-Tune an Open Source LLM

The Smol Training Playbook