LocateAnything: Fast and High-Quality Vision-Language Grounding with Parallel Box Decoding Paper • 2605.27365 • Published 6 days ago • 127
Nemotron RAG Collection Set of tools to build retrieval-augmented generation (RAG) systems, improve search and ranking accuracy, and extract structured data from complex docs • 10 items • Updated 1 day ago • 92
Sentence Transformers v5.4 integrations Collection See https://huggingface.co/blog/multimodal-sentence-transformers • 34 items • Updated 24 days ago • 5
🍎 Qwopus3.6 Collection This collection features the advanced Qwopus3.6 series of multimodal large models, which are fine-tuned from the Qwen3.6 base models with a focus on e • 10 items • Updated 8 days ago • 58
📙 LLM Engineer's Handbook Collection Models and datasets from my book. All the code is freely available at https://github.com/PacktPublishing/LLM-Engineers-Handbook • 6 items • Updated Apr 7, 2025 • 16
Kimi-K2 Collection Moonshot's MoE LLMs with 1 trillion parameters, exceptional on agentic intellegence • 5 items • Updated Jan 27 • 174
MiMo: Unlocking the Reasoning Potential of Language Model -- From Pretraining to Posttraining Paper • 2505.07608 • Published May 12, 2025 • 86
Perception, Reason, Think, and Plan: A Survey on Large Multimodal Reasoning Models Paper • 2505.04921 • Published May 8, 2025 • 187
Absolute Zero: Reinforced Self-play Reasoning with Zero Data Paper • 2505.03335 • Published May 6, 2025 • 191
view article Article Yes, Transformers are Effective for Time Series Forecasting (+ Autoformer) +1 elisim, kashif, nielsr • Jun 16, 2023 • 45
100 Days After DeepSeek-R1: A Survey on Replication Studies and More Directions for Reasoning Language Models Paper • 2505.00551 • Published May 1, 2025 • 36
view article Article What is MoE 2.0? Update Your Knowledge about Mixture-of-experts Kseniase • Apr 27, 2025 • 10
R1-Reward: Training Multimodal Reward Model Through Stable Reinforcement Learning Paper • 2505.02835 • Published May 5, 2025 • 28
view article Article 🦸🏻#14: What Is MCP, and Why Is Everyone – Suddenly!– Talking About It? Kseniase • Mar 17, 2025 • 357
view article Article Mini-R1: Reproduce Deepseek R1 „aha moment“ a RL tutorial open-r1 • Jan 31, 2025 • 51