MetricNet: Recovering Metric Scale in Generative Navigation Policies

1 University of Technology Nuremberg
2 Inria, Ecole Normale Superieure, CNRS, PSL Research University
* Authors contributed equally

MetricNet generates waypoints in real-world coordinates, MetricNav uses these to guide the robot away from obstacles and towards the goal.

3-min overview · captions available · Open on YouTube

Abstract

Generative navigation policies have made rapid progress in improving end-to-end learned navigation. Despite their promising results, this paradigm has two structural problems. First, the sampled trajectories exist in an abstract, unscaled space without metric grounding. Second, the control strategy discards the full path, instead moving directly towards a single waypoint. This leads to short-sighted and unsafe actions, moving the robot towards obstacles that a complete and correctly scaled path would circumvent. To address these issues, we propose MetricNet, an effective add-on for generative navigation that predicts the metric distance between waypoints, grounding policy outputs in real-world coordinates. We evaluate our method in simulation with a new benchmarking framework and show that executing MetricNet-scaled waypoints significantly improves both navigation and exploration performance. Beyond simulation, we further validate our approach in real-world experiments. Finally, we propose MetricNav, which integrates MetricNet into a navigation policy to guide the robot away from obstacles while still moving towards the goal.

Avoiding obstacles with MetricNav

MetricNav is able to do complete 180 degree turns if there are is no free space in front.

Velocity Control v/s following MetricNet waypoints as is

Metric waypoints in real-world coordinates

MetricNet demonstrates that including depth priors during model training helps the model more representative better action policies for robot navigation.

BibTeX

@misc{nayak2025metricnetrecoveringmetricscale,
      title={MetricNet: Recovering Metric Scale in Generative Navigation Policies}, 
      author={Abhijeet Nayak and Débora N. P. Oliveira and Samiran Gode and Cordelia Schmid and Wolfram Burgard},
      year={2025},
      eprint={2509.13965},
      archivePrefix={arXiv},
      primaryClass={cs.RO},
      url={https://arxiv.org/abs/2509.13965}, 
      }