3-min overview · captions available · Open on YouTube
Generative navigation policies have made rapid progress in improving end-to-end learned navigation. Despite their promising results, this paradigm has two structural problems. First, the sampled trajectories exist in an abstract, unscaled space without metric grounding. Second, the control strategy discards the full path, instead moving directly towards a single waypoint. This leads to short-sighted and unsafe actions, moving the robot towards obstacles that a complete and correctly scaled path would circumvent. To address these issues, we propose MetricNet, an effective add-on for generative navigation that predicts the metric distance between waypoints, grounding policy outputs in real-world coordinates. We evaluate our method in simulation with a new benchmarking framework and show that executing MetricNet-scaled waypoints significantly improves both navigation and exploration performance. Beyond simulation, we further validate our approach in real-world experiments. Finally, we propose MetricNav, which integrates MetricNet into a navigation policy to guide the robot away from obstacles while still moving towards the goal.
MetricNav is able to do complete 180 degree turns if there are is no free space in front.
MetricNet demonstrates that including depth priors during model training helps the model more representative better action policies for robot navigation.
@misc{nayak2025metricnetrecoveringmetricscale,
title={MetricNet: Recovering Metric Scale in Generative Navigation Policies},
author={Abhijeet Nayak and Débora N. P. Oliveira and Samiran Gode and Cordelia Schmid and Wolfram Burgard},
year={2025},
eprint={2509.13965},
archivePrefix={arXiv},
primaryClass={cs.RO},
url={https://arxiv.org/abs/2509.13965},
}