What Happened in LLMs Layers when Trained for Fast vs. Slow Thinking: A Gradient Perspective Paper • 2410.23743 • Published Oct 31, 2024 • 64
Statler: State-Maintaining Language Models for Embodied Reasoning Paper • 2306.17840 • Published Jun 30, 2023 • 12