Knowledge Graphs are Implicit Reward Models: Path-Derived Signals Enable Compositional Reasoning Paper • 2601.15160 • Published 28 days ago • 1