MapTrace: Scalable Data Generation for Route Tracing on Maps
Artemis Panagopoulou, Aveek Purohit, Achin Kulshrestha, Soroosh Yazdani, Mohit Goyal

TL;DR
MapTrace introduces a scalable synthetic data pipeline for route tracing on maps, enabling large-scale fine-tuning of multimodal models to improve their spatial reasoning abilities in map navigation tasks.
Contribution
The paper presents a novel synthetic data generation method for pixel-accurate route annotations, facilitating effective fine-tuning of multimodal models for spatial reasoning.
Findings
Fine-tuning with MapTrace data improves success rates by up to 6.4 points.
Synthetic supervision significantly enhances model robustness in route tracing.
The pipeline enables large-scale, pixel-level annotation for spatial tasks.
Abstract
While Multimodal Large Language Models have achieved human-like performance on many visual and textual reasoning tasks, their proficiency in fine-grained spatial understanding, such as route tracing on maps remains limited. Unlike humans, who can quickly learn to parse and navigate maps, current models often fail to respect fundamental path constraints, in part due to the prohibitive cost and difficulty of collecting large-scale, pixel-accurate path annotations. To address this, we introduce a scalable synthetic data generation pipeline that leverages synthetic map images and pixel-level parsing to automatically produce precise annotations for this challenging task. Using this pipeline, we construct a fine-tuning dataset of 23k path samples across 4k maps, enabling models to acquire more human-like spatial capabilities. Using this dataset, we fine-tune both open-source and proprietary…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSpatial Cognition and Navigation · Multimodal Machine Learning Applications · Geographic Information Systems Studies
