3 18 1

Devin Thang

winvswon78

devininthelab

AI & ML interests

None yet

Recent Activity

new activity 23 days ago

stabilityai/stable-video-diffusion-img2vid-xt:Can SVD be use with DDPMScheduler?

upvoted a paper about 1 month ago

OpenMMReasoner: Pushing the Frontiers for Multimodal Reasoning with an Open and General Recipe

commented on an article about 2 months ago

Navigating the RLHF Landscape: From Policy Gradients to PPO, GAE, and DPO for LLM Alignment

View all activity

Organizations

upvoted a paper about 1 month ago

OpenMMReasoner: Pushing the Frontiers for Multimodal Reasoning with an Open and General Recipe

Paper • 2511.16334 • Published Nov 20, 2025 • 92

upvoted 2 articles 4 months ago

Article

Navigating the RLHF Landscape: From Policy Gradients to PPO, GAE, and DPO for LLM Alignment

Feb 11, 2025

•

Article

DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge

Feb 7, 2025

•

268

upvoted a paper 5 months ago

Reconstructing 4D Spatial Intelligence: A Survey

Paper • 2507.21045 • Published Jul 28, 2025 • 36

upvoted a paper 7 months ago

Aligning Latent Spaces with Flow Priors

Paper • 2506.05240 • Published Jun 5, 2025 • 27

upvoted 2 articles 7 months ago

Article

KV Cache from scratch in nanoVLM

Jun 4, 2025

•

108

Article

Vision Language Models (Better, faster, stronger)

May 12, 2025

•

580

upvoted a paper 8 months ago

Emerging Properties in Unified Multimodal Pretraining

Paper • 2505.14683 • Published May 20, 2025 • 133

upvoted an article 8 months ago

Article

nanoVLM: The simplest repository to train your VLM in pure PyTorch

May 21, 2025

•

247

upvoted a collection 8 months ago

Aero-1-Audio

Collection

2 items • Updated May 1, 2025 • 1

upvoted a collection 9 months ago

Qwen2.5-Omni

Collection

End-to-End Omni (text, audio, image, video, and natural speech interaction) model based Qwen2.5 • 7 items • Updated 6 days ago • 160

upvoted a paper 11 months ago

Matryoshka Quantization

Paper • 2502.06786 • Published Feb 10, 2025 • 32

upvoted a collection 11 months ago

Ola

Collection

Pushing the Frontiers of Omni-Modal Language Model with Progressive Modality Alignment • 4 items • Updated Feb 21, 2025 • 3

upvoted a paper 11 months ago

Lost in the Middle: How Language Models Use Long Contexts

Paper • 2307.03172 • Published Jul 6, 2023 • 43

upvoted 2 articles 11 months ago

Article

Mixture of Experts Explained

Dec 11, 2023

•

1.02k

Article

How NuminaMath Won the 1st AIMO Progress Prize

Jul 11, 2024

•

124

upvoted a paper 12 months ago

Video-MMMU: Evaluating Knowledge Acquisition from Multi-Discipline Professional Videos

Paper • 2501.13826 • Published Jan 23, 2025 • 23

upvoted a paper about 1 year ago

Large Multi-modal Models Can Interpret Features in Large Multi-modal Models

Paper • 2411.14982 • Published Nov 22, 2024 • 19

Devin Thang

AI & ML interests

Recent Activity

Organizations

winvswon78's activity

Navigating the RLHF Landscape: From Policy Gradients to PPO, GAE, and DPO for LLM Alignment

DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge

KV Cache from scratch in nanoVLM

Vision Language Models (Better, faster, stronger)

nanoVLM: The simplest repository to train your VLM in pure PyTorch

Mixture of Experts Explained

How NuminaMath Won the 1st AIMO Progress Prize