4 11

ruins

ruinnight

AI & ML interests

None yet

Recent Activity

liked a Space 5 days ago

lm-provers/qed-nano-blogpost

upvoted an article about 2 months ago

Why We Built VIBE Bench: Rethinking Evaluation for Real Workloads

upvoted an article about 2 months ago

M2.1: Multilingual and Multi-Task Coding with Strong Generalization

View all activity

Organizations

None yet

liked a Space 5 days ago

QED-Nano: Teaching a Tiny Model to Prove Hard Theorems

📝

Who needs 1T parameters? Olympiad proofs with a 4B model

upvoted 2 articles about 2 months ago

Article

Why We Built VIBE Bench: Rethinking Evaluation for Real Workloads

Jan 6

•

Article

M2.1: Multilingual and Multi-Task Coding with Strong Generalization

Jan 5

•

upvoted 2 articles 4 months ago

Article

History of State Space Models (SSM) in 2022

Apr 11, 2024

•

Article

Introduction to State Space Models (SSM)

Jul 19, 2024

•

212

liked 3 Spaces 4 months ago

Unlocking On-Policy Distillation for Any Model Family

📝

Visualize on-policy distillation for any model family

Scaling FineWeb to 1000+ languages: Step 1: finding signal in 100s of evaluation tasks

📝

Evaluate multilingual models using FineTasks

The Smol Training Playbook

📚

3.02k

The secrets to building world-class LLMs

liked 3 models 6 months ago

liked a dataset 6 months ago

GPUMODE/KernelBook

Viewer • Updated 22 days ago • 18.2k • 573 • 46

liked a dataset 11 months ago

OpenDILabCommunity/MasterMind

Viewer • Updated Mar 20, 2025 • 696k • 541 • 6

liked a Space 12 months ago

Number Tokenization Blog

📈

113

Explore how tokenization affects arithmetic in LLMs

liked a Space about 1 year ago

The Ultra-Scale Playbook

🌌

3.71k

The ultimate guide to training LLM on large GPU Clusters

ruins

AI & ML interests

Recent Activity

Organizations

ruinnight's activity

QED-Nano: Teaching a Tiny Model to Prove Hard Theorems

Why We Built VIBE Bench: Rethinking Evaluation for Real Workloads

M2.1: Multilingual and Multi-Task Coding with Strong Generalization

History of State Space Models (SSM) in 2022

Introduction to State Space Models (SSM)

Unlocking On-Policy Distillation for Any Model Family

Scaling FineWeb to 1000+ languages: Step 1: finding signal in 100s of evaluation tasks

The Smol Training Playbook

Number Tokenization Blog

The Ultra-Scale Playbook