OsmT: Bridging OpenStreetMap Queries and Natural Language with Open-source Tag-aware Language Models
Zhuoyue Wan, Wentao Hu, Chen Jason Zhang, Yuanfeng Song, Shuaimin Li, Ruiqiang Xiao, Xiao-Yong Wei, and Raymond Chi-Wing Wong

TL;DR
OsmT is an open-source, tag-aware language model designed to convert natural language into Overpass Query Language for OpenStreetMap data, improving accuracy and interpretability while being lightweight and adaptable.
Contribution
The paper introduces OsmT, a novel open-source language model with a Tag Retrieval Augmentation mechanism tailored for geospatial query generation and interpretation.
Findings
OsmT outperforms strong baselines in query accuracy.
It achieves competitive results with fewer parameters.
The model enhances query interpretability through reverse translation.
Abstract
Bridging natural language and structured query languages is a long-standing challenge in the database community. While recent advances in language models have shown promise in this direction, existing solutions often rely on large-scale closed-source models that suffer from high inference costs, limited transparency, and lack of adaptability for lightweight deployment. In this paper, we present OsmT, an open-source tag-aware language model specifically designed to bridge natural language and Overpass Query Language (OverpassQL), a structured query language for accessing large-scale OpenStreetMap (OSM) data. To enhance the accuracy and structural validity of generated queries, we introduce a Tag Retrieval Augmentation (TRA) mechanism that incorporates contextually relevant tag knowledge into the generation process. This mechanism is designed to capture the hierarchical and relational…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGeographic Information Systems Studies · Data Management and Algorithms · Constraint Satisfaction and Optimization
