mradermacher/Qwen3-14B-ARPO-DeepSearch-GGUF Reinforcement Learning • 15B • Updated Aug 12, 2025 • 43 • 2
mradermacher/Qwen3-14B-ARPO-DeepSearch-i1-GGUF Reinforcement Learning • 15B • Updated 13 days ago • 1.18k • 1