-
An Image is Worth 1/2 Tokens After Layer 2: Plug-and-Play Inference Acceleration for Large Vision-Language Models
Paper • 2403.06764 • Published • 27 -
VideoMamba: State Space Model for Efficient Video Understanding
Paper • 2403.06977 • Published • 29 -
Phased Consistency Model
Paper • 2405.18407 • Published • 48
Natalia Frumkin
nfrumkin
·
AI & ML interests
None yet
Recent Activity
upvoted a paper about 21 hours ago
Training-free Latent Inter-Frame Pruning with Attention Recovery upvoted a paper 6 months ago
Jumping through Local Minima: Quantization in the Loss Landscape of
Vision Transformers