BuiDoan's picture

BuiDoan

BuiDoan

·

AI & ML interests

None yet

Recent Activity

liked a dataset about 5 hours ago

openbmb/Ultra-FineWeb-L3

liked a model about 5 hours ago

nvidia/LocateAnything-3B

liked a model about 5 hours ago

openbmb/MiniCPM5-1B

View all activity

Organizations

upvoted a paper 4 days ago

LocateAnything: Fast and High-Quality Vision-Language Grounding with Parallel Box Decoding

Paper • 2605.27365 • Published 6 days ago • 127

upvoted 2 changelogs 11 days ago

Hugging Face Changelog

Spaces agents.md for your coding agents

Apr 17

• 327

Hugging Face Changelog

Filter Leaderboards by Model Size

11 days ago

• 105

upvoted 2 collections 24 days ago

Nemotron RAG

Set of tools to build retrieval-augmented generation (RAG) systems, improve search and ranking accuracy, and extract structured data from complex docs • 10 items • Updated 1 day ago • 92

Sentence Transformers v5.4 integrations

See https://huggingface.co/blog/multimodal-sentence-transformers • 34 items • Updated 24 days ago • 5

upvoted a collection 25 days ago

🍎 Qwopus3.6

This collection features the advanced Qwopus3.6 series of multimodal large models, which are fine-tuned from the Qwen3.6 base models with a focus on e • 10 items • Updated 8 days ago • 58

upvoted a paper about 1 month ago

DFlash: Block Diffusion for Flash Speculative Decoding

Paper • 2602.06036 • Published Feb 5 • 82

upvoted a collection 6 months ago

📙 LLM Engineer's Handbook

Models and datasets from my book. All the code is freely available at https://github.com/PacktPublishing/LLM-Engineers-Handbook • 6 items • Updated Apr 7, 2025 • 16

upvoted a collection 11 months ago

Kimi-K2

Moonshot's MoE LLMs with 1 trillion parameters, exceptional on agentic intellegence • 5 items • Updated Jan 27 • 174

upvoted an article about 1 year ago

Article

The 4 Things Qwen-3’s Chat Template Teaches Us

cfahlgren1

•

Apr 30, 2025

• 88

upvoted 3 papers about 1 year ago

MiMo: Unlocking the Reasoning Potential of Language Model -- From Pretraining to Posttraining

Paper • 2505.07608 • Published May 12, 2025 • 86

Perception, Reason, Think, and Plan: A Survey on Large Multimodal Reasoning Models

Paper • 2505.04921 • Published May 8, 2025 • 187

Absolute Zero: Reinforced Self-play Reasoning with Zero Data

Paper • 2505.03335 • Published May 6, 2025 • 191

upvoted an article about 1 year ago

Article

Yes, Transformers are Effective for Time Series Forecasting (+ Autoformer)

+1

elisim, kashif, nielsr

•

Jun 16, 2023

• 45

upvoted 2 papers about 1 year ago

100 Days After DeepSeek-R1: A Survey on Replication Studies and More Directions for Reasoning Language Models

Paper • 2505.00551 • Published May 1, 2025 • 36

ReasonIR: Training Retrievers for Reasoning Tasks

Paper • 2504.20595 • Published Apr 29, 2025 • 54

upvoted an article about 1 year ago

Article

What is MoE 2.0? Update Your Knowledge about Mixture-of-experts

Kseniase

•

Apr 27, 2025

• 10

upvoted a paper about 1 year ago

R1-Reward: Training Multimodal Reward Model Through Stable Reinforcement Learning

Paper • 2505.02835 • Published May 5, 2025 • 28

upvoted 2 articles about 1 year ago

Article

🦸🏻#14: What Is MCP, and Why Is Everyone – Suddenly!– Talking About It?

Kseniase

•

Mar 17, 2025

• 357

Article

Mini-R1: Reproduce Deepseek R1 „aha moment“ a RL tutorial

open-r1

•

Jan 31, 2025

• 51