tencent/Youtu-LLM-2B
Text Generation
•
2B
•
Updated
•
6.61k
•
213
None defined yet.
Render-of-Thought: Rendering Textual Chain-of-Thought as Images for Visual Latent Reasoning
AT$^2$PO: Agentic Turn-based Policy Optimization via Tree Search