Elastic Attention: Test-time Adaptive Sparsity Ratios for Efficient Transformers Paper • 2601.17367 • Published 9 days ago • 33
MemoryRewardBench: Benchmarking Reward Models for Long-Term Memory Management in Large Language Models Paper • 2601.11969 • Published 16 days ago • 26
meta-llama/Llama-3.1-8B-Instruct Text Generation • 8B • Updated Sep 25, 2024 • 9.95M • • 5.36k
LongRM: Revealing and Unlocking the Context Boundary of Reward Modeling Paper • 2510.06915 • Published Oct 8, 2025 • 16
Revisiting Long-context Modeling from Context Denoising Perspective Paper • 2510.05862 • Published Oct 7, 2025 • 21