NaVILA: Legged Robot Vision-Language-Action Model for Navigation Paper • 2412.04453 • Published Dec 5, 2024
SpatialRGPT: Grounded Spatial Reasoning in Vision Language Models Paper • 2406.01584 • Published Jun 3, 2024