What matters for Representation Alignment: Global Information or Spatial Structure? Paper • 2512.10794 • Published 16 days ago • 8
view article Article How to generate text: using different decoding methods for language generation with Transformers Mar 1, 2020 • 275
Native Sparse Attention: Hardware-Aligned and Natively Trainable Sparse Attention Paper • 2502.11089 • Published Feb 16 • 166
SNOOPI: Supercharged One-step Diffusion Distillation with Proper Guidance Paper • 2412.02687 • Published Dec 3, 2024 • 113
SwiftBrush v2: Make Your One-step Diffusion Model Better Than Its Teacher Paper • 2408.14176 • Published Aug 26, 2024 • 62