FlashSVD: Memory-Efficient Inference with Streaming for Low-Rank Models Paper • 2508.01506 • Published Aug 2 • 1
KVCOMM: Online Cross-context KV-cache Communication for Efficient LLM-based Multi-agent Systems Paper • 2510.12872 • Published Oct 14 • 2
KVCOMM: Online Cross-context KV-cache Communication for Efficient LLM-based Multi-agent Systems Paper • 2510.12872 • Published Oct 14 • 2 • 2
HippoMM: Hippocampal-inspired Multimodal Memory for Long Audiovisual Event Understanding Paper • 2504.10739 • Published Apr 14 • 2
IoT-MCP: Bridging LLMs and IoT Systems Through Model Context Protocol Paper • 2510.01260 • Published Sep 25 • 2
FlashSVD: Memory-Efficient Inference with Streaming for Low-Rank Models Paper • 2508.01506 • Published Aug 2 • 1
KVCOMM: Online Cross-context KV-cache Communication for Efficient LLM-based Multi-agent Systems Paper • 2510.12872 • Published Oct 14 • 2
Vision-Zero: Scalable VLM Self-Improvement via Strategic Gamified Self-Play Paper • 2509.25541 • Published Sep 29 • 140
CoreMatching: A Co-adaptive Sparse Inference Framework with Token and Neuron Pruning for Comprehensive Acceleration of Vision-Language Models Paper • 2505.19235 • Published May 25 • 3
Angles Don't Lie: Unlocking Training-Efficient RL Through the Model's Own Signals Paper • 2506.02281 • Published Jun 2 • 4
MME-Reasoning: A Comprehensive Benchmark for Logical Reasoning in MLLMs Paper • 2505.21327 • Published May 27 • 83
Performance-aware Approximation of Global Channel Pruning for Multitask CNNs Paper • 2303.11923 • Published Mar 21, 2023 • 1
Training-Free Adaptive Diffusion with Bounded Difference Approximation Strategy Paper • 2410.09873 • Published Oct 13, 2024 • 3
GeoX: Geometric Problem Solving Through Unified Formalized Vision-Language Pre-training Paper • 2412.11863 • Published Dec 16, 2024 • 4
DocGenome: An Open Large-scale Scientific Document Benchmark for Training and Testing Multi-modal Large Language Models Paper • 2406.11633 • Published Jun 17, 2024 • 1
Training-Free Adaptive Diffusion with Bounded Difference Approximation Strategy Paper • 2410.09873 • Published Oct 13, 2024 • 3
Once for Both: Single Stage of Importance and Sparsity Search for Vision Transformer Compression Paper • 2403.15835 • Published Mar 23, 2024