Generating Landmark Navigation Instructions from Maps as a Graph-to-Text   Problem

Raphael Schumann; Stefan Riezler

arXiv:2012.15329·cs.CL·May 27, 2021

Generating Landmark Navigation Instructions from Maps as a Graph-to-Text Problem

Raphael Schumann, Stefan Riezler

PDF

TL;DR

This paper introduces a neural model that converts map graphs into human-like landmark-based navigation instructions, improving natural language guidance for navigation tasks.

Contribution

It presents a novel graph-to-text model trained on a new dataset, generating landmark-based instructions from map representations for navigation.

Findings

01

Generated instructions are similar to human instructions.

02

Instructions successfully guide humans in Street View navigation.

03

Model achieves effective landmark-based navigation guidance.

Abstract

Car-focused navigation services are based on turns and distances of named streets, whereas navigation instructions naturally used by humans are centered around physical objects called landmarks. We present a neural model that takes OpenStreetMap representations as input and learns to generate navigation instructions that contain visible and salient landmarks from human natural language instructions. Routes on the map are encoded in a location- and rotation-invariant graph representation that is decoded into natural language instructions. Our work is based on a novel dataset of 7,672 crowd-sourced instances that have been verified by human navigation in Street View. Our evaluation shows that the navigation instructions generated by our system have similar properties as human-generated instructions, and lead to successful human navigation in Street View.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.