dfuhoiysOHSVFh82934gfjklb

huba-buba

AI & ML interests

None yet

Recent Activity

upvoted a paper about 16 hours ago

Experiential Reinforcement Learning

upvoted a paper 1 day ago

GigaBrain-0.5M*: a VLA That Learns From World Model-Based Reinforcement Learning

upvoted a paper 1 day ago

Less is Enough: Synthesizing Diverse Data in Feature Space of LLMs

View all activity

Organizations

None yet

upvoted a paper about 16 hours ago

Experiential Reinforcement Learning

Paper • 2602.13949 • Published 3 days ago • 38

upvoted 2 papers 1 day ago

GigaBrain-0.5M*: a VLA That Learns From World Model-Based Reinforcement Learning

Paper • 2602.12099 • Published 6 days ago • 53

Less is Enough: Synthesizing Diverse Data in Feature Space of LLMs

Paper • 2602.10388 • Published 7 days ago • 208

upvoted an article 5 days ago

Article

Forge: Scalable Agent RL Framework and Algorithm

5 days ago

•

123

upvoted 3 papers 8 days ago

upvoted a paper 9 days ago

F-GRPO: Don't Let Your Policy Learn the Obvious and Forget the Rare

Paper • 2602.06717 • Published 12 days ago • 70

upvoted 3 papers 10 days ago

WideSeek-R1: Exploring Width Scaling for Broad Information Seeking via Multi-Agent Reinforcement Learning

Paper • 2602.04634 • Published 14 days ago • 93

Dr. Kernel: Reinforcement Learning Done Right for Triton Kernel Generations

Paper • 2602.05885 • Published 13 days ago • 28

Reinforcement World Model Learning for LLM-based Agents

Paper • 2602.05842 • Published 13 days ago • 26

upvoted a paper 13 days ago

No One-Size-Fits-All: Building Systems For Translation to Bashkir, Kazakh, Kyrgyz, Tatar and Chuvash Using Synthetic And Original Data

Paper • 2602.04442 • Published 14 days ago • 3

upvoted an article 15 days ago

Article

🐯 Liger GRPO meets TRL

May 25, 2025

•

upvoted 3 papers 15 days ago

Kimi K2.5: Visual Agentic Intelligence

Paper • 2602.02276 • Published 16 days ago • 233

Green-VLA: Staged Vision-Language-Action Model for Generalist Robots

Paper • 2602.00919 • Published 17 days ago • 280

SWE-Universe: Scale Real-World Verifiable Environments to Millions

Paper • 2602.02361 • Published 16 days ago • 60

upvoted a paper 18 days ago

VisGym: Diverse, Customizable, Scalable Environments for Multimodal Agents

Paper • 2601.16973 • Published 25 days ago • 40

upvoted an article 19 days ago

Article

Small Language Models (SLM): A Comprehensive Overview

Feb 22, 2025

•

131

upvoted an article 22 days ago

Article

Mixture of Experts Explained

Dec 11, 2023

•

1.07k

upvoted an article about 1 month ago

Article

The Engineering Handbook for GRPO + LoRA with Verl: Training Qwen2.5 on Multi-GPU

Jan 2

•

dfuhoiysOHSVFh82934gfjklb

AI & ML interests

Recent Activity

Organizations

huba-buba's activity

Forge: Scalable Agent RL Framework and Algorithm

🐯 Liger GRPO meets TRL

Small Language Models (SLM): A Comprehensive Overview

Mixture of Experts Explained

The Engineering Handbook for GRPO + LoRA with Verl: Training Qwen2.5 on Multi-GPU