GENIUS: Generative Fluid Intelligence Evaluation Suite Paper • 2602.11144 • Published 4 days ago • 52
Are Video Models Ready as Zero-Shot Reasoners? An Empirical Study with the MME-CoF Benchmark Paper • 2510.26802 • Published Oct 30, 2025 • 34