One Sample to Rule Them All: Extreme Data Efficiency in RL Scaling Paper • 2601.03111 • Published 7 days ago • 8
FRoG: Evaluating Fuzzy Reasoning of Generalized Quantifiers in Large Language Models Paper • 2407.01046 • Published Jul 1, 2024