namezz/lvm-rel-a-qwen2.5-3b-instruct-b-qwen2.5-1.5b-instruct Text Generation • 3B • Updated 3 days ago • 8
namezz/lvm-rel-a-qwen2.5-3b-instruct-b-qwen2.5-1.5b-instruct Text Generation • 3B • Updated 3 days ago • 8
namezz/lvm-rel-a-qwen2.5-3b-instruct-b-qwen2.5-3b-instruct Text Generation • 3B • Updated 3 days ago • 10
namezz/lvm-rel-a-qwen2.5-3b-instruct-b-qwen2.5-3b-instruct Text Generation • 3B • Updated 3 days ago • 10
ThinkRouter: Efficient Reasoning via Routing Thinking between Latent and Discrete Spaces Paper • 2602.11683 • Published 10 days ago • 7
namezz/cold-start-qwen-8b-base-inittag-keepthink-lr1e-5-gpu4-bs2-ga8-ep2-wr0.1-cut12000 308k • Updated Dec 12, 2025