Flash-KMeans: Fast and Memory-Efficient Exact K-Means Paper • 2603.09229 • Published 10 days ago • 79
view article Article Who Routes LLM Routers? RouterArena: Building the Evaluation Foundation for LLM Routing Nov 11, 2025 • 14
ProxSparse: Regularized Learning of Semi-Structured Sparsity Masks for Pretrained LLMs Paper • 2502.00258 • Published Feb 1, 2025 • 1
Stop Overthinking: A Survey on Efficient Reasoning for Large Language Models Paper • 2503.16419 • Published Mar 20, 2025 • 77