nicolas's picture

48 54

nicolas

niko91i

·

AI & ML interests

None yet

Recent Activity

liked a model about 16 hours ago

zai-org/GLM-ASR-Nano-2512

upvoted a paper about 16 hours ago

Native Parallel Reasoner: Reasoning in Parallelism via Self-Distilled Reinforcement Learning

liked a model 1 day ago

zai-org/GLM-4.6V

View all activity

Organizations

None yet

upvoted a paper about 16 hours ago

Native Parallel Reasoner: Reasoning in Parallelism via Self-Distilled Reinforcement Learning

Paper • 2512.07461 • Published 2 days ago • 59

upvoted a paper 14 days ago

GigaEvo: An Open Source Optimization Framework Powered By LLMs And Evolution Algorithms

Paper • 2511.17592 • Published 23 days ago • 118

upvoted a paper 27 days ago

TiDAR: Think in Diffusion, Talk in Autoregression

Paper • 2511.08923 • Published 28 days ago • 112

upvoted a paper 28 days ago

Tiny Model, Big Logic: Diversity-Driven Optimization Elicits Large-Model Reasoning Ability in VibeThinker-1.5B

Paper • 2511.06221 • Published Nov 9 • 129

upvoted a paper about 1 month ago

Continuous Autoregressive Language Models

Paper • 2510.27688 • Published Oct 31 • 70

upvoted 4 papers about 2 months ago

REAP the Experts: Why Pruning Prevails for One-Shot MoE compression

Paper • 2510.13999 • Published Oct 15 • 5

SPARK: Synergistic Policy And Reward Co-Evolving Framework

Paper • 2509.22624 • Published Sep 26 • 17

When Models Lie, We Learn: Multilingual Span-Level Hallucination Detection with PsiloQA

Paper • 2510.04849 • Published Oct 6 • 113

Less is More: Recursive Reasoning with Tiny Networks

Paper • 2510.04871 • Published Oct 6 • 497

upvoted 2 papers 3 months ago

rStar2-Agent: Agentic Reasoning Technical Report

Paper • 2508.20722 • Published Aug 28 • 116

Deep Think with Confidence

Paper • 2508.15260 • Published Aug 21 • 88

upvoted 6 papers 4 months ago

Prompt Orchestration Markup Language

Paper • 2508.13948 • Published Aug 19 • 48

R-Zero: Self-Evolving Reasoning LLM from Zero Data

Paper • 2508.05004 • Published Aug 7 • 130

ScreenCoder: Advancing Visual-to-Code Generation for Front-End Automation via Modular Multimodal Agents

Paper • 2507.22827 • Published Jul 30 • 99

HunyuanWorld 1.0: Generating Immersive, Explorable, and Interactive 3D Worlds from Words or Pixels

Paper • 2507.21809 • Published Jul 29 • 135

SmallThinker: A Family of Efficient Large Language Models Natively Trained for Local Deployment

Paper • 2507.20984 • Published Jul 28 • 56

Agentic Reinforced Policy Optimization

Paper • 2507.19849 • Published Jul 26 • 158

upvoted 3 papers 5 months ago

Deep Researcher with Test-Time Diffusion

Paper • 2507.16075 • Published Jul 21 • 67

MiroMind-M1: An Open-Source Advancement in Mathematical Reasoning via Context-Aware Multi-Stage Policy Optimization

Paper • 2507.14683 • Published Jul 19 • 134

KV Cache Steering for Inducing Reasoning in Small Language Models

Paper • 2507.08799 • Published Jul 11 • 40