SWE-chat: Coding Agent Interactions From Real Users in the Wild Paper • 2604.20779 • Published 2 days ago • 9
Running 3.8k The Ultra-Scale Playbook 🌌 3.8k The ultimate guide to training LLM on large GPU Clusters
AMDGPU onnx Collection optimized image generation ONNX models for AMD Ryzen (TM) AI GPUs and Radeon Discrete GPUs • 19 items • Updated Mar 2 • 13
SageAttention3: Microscaling FP4 Attention for Inference and An Exploration of 8-Bit Training Paper • 2505.11594 • Published May 16, 2025 • 75
Elucidating the Design Space of Diffusion-Based Generative Models Paper • 2206.00364 • Published Jun 1, 2022 • 18
PaliGemma 2 Release Collection Vision-Language Models available in multiple 3B, 10B and 28B variants. • 32 items • Updated Mar 12 • 152