ROOT: Robust Orthogonalized Optimizer for Neural Network Training Paper โข 2511.20626 โข Published Nov 25, 2025 โข 43 โข 5
DenseMamba: State Space Models with Dense Hidden Connection for Efficient Large Language Models Paper โข 2403.00818 โข Published Feb 26, 2024 โข 19 โข 2