ManagerBench: Evaluating the Safety-Pragmatism Trade-off in Autonomous LLMs Paper • 2510.00857 • Published Oct 1, 2025 • 1
Global PIQA: Evaluating Physical Commonsense Reasoning Across 100+ Languages and Cultures Paper • 2510.24081 • Published Oct 28, 2025 • 22
Models In a Spelling Bee: Language Models Implicitly Learn the Character Composition of Tokens Paper • 2108.11193 • Published Jun 8, 2022
Alignment Makes Language Models Normative, Not Descriptive Paper • 2603.17218 • Published about 1 month ago • 46
Thinking to Recall: How Reasoning Unlocks Parametric Knowledge in LLMs Paper • 2603.09906 • Published Mar 10 • 75
Empty Shelves or Lost Keys? Recall Is the Bottleneck for Parametric Factuality Paper • 2602.14080 • Published Feb 15 • 21
STATe-of-Thoughts: Structured Action Templates for Tree-of-Thoughts Paper • 2602.14265 • Published Feb 15 • 21
The Poisoned Apple Effect: Strategic Manipulation of Mediated Markets via Technology Expansion of AI Agents Paper • 2601.11496 • Published Jan 16 • 47
DeLeaker: Dynamic Inference-Time Reweighting For Semantic Leakage Mitigation in Text-to-Image Models Paper • 2510.15015 • Published Oct 16, 2025 • 11
Fine-Grained Detection of Context-Grounded Hallucinations Using LLMs Paper • 2509.22582 • Published Sep 26, 2025 • 12
CRISP: Persistent Concept Unlearning via Sparse Autoencoders Paper • 2508.13650 • Published Aug 19, 2025 • 16
Planted in Pretraining, Swayed by Finetuning: A Case Study on the Origins of Cognitive Biases in LLMs Paper • 2507.07186 • Published Jul 9, 2025 • 3