GTPBD-MM: A Global Terraced Parcel and Boundary Dataset with Multi-Modality

Zhiwei Zhang; Xingyuan Zeng; Xinkai Kong; Kunquan Zhang; Haoyuan Liang; Bohan Shi; Juepeng Zheng; Jianxi Huang; Yutong Lu; Haohuan Fu

arXiv:2604.12315·cs.CV·April 15, 2026

GTPBD-MM: A Global Terraced Parcel and Boundary Dataset with Multi-Modality

Zhiwei Zhang, Xingyuan Zeng, Xinkai Kong, Kunquan Zhang, Haoyuan Liang, Bohan Shi, Juepeng Zheng, Jianxi Huang, Yutong Lu, Haohuan Fu

PDF

2 Datasets

TL;DR

This paper introduces GTPBD-MM, a comprehensive multimodal benchmark dataset for extracting terraced parcels in mountainous regions, integrating optical imagery, text descriptions, and DEM data to improve parcel delineation accuracy.

Contribution

It presents the first unified benchmark for complex terraced parcel extraction using aligned image-text-DEM data and proposes a multimodal baseline network, ETTerra.

Findings

01

Textual semantics and terrain geometry improve delineation accuracy.

02

Multimodal cues lead to more coherent parcel boundaries.

03

Experiments show significant performance gains over visual-only methods.

Abstract

Agricultural parcel extraction plays an important role in remote sensing-based agricultural monitoring, supporting parcel surveying, precision management, and ecological assessment. However, existing public benchmarks mainly focus on regular and relatively flat farmland scenes. In contrast, terraced parcels in mountainous regions exhibit stepped terrain, pronounced elevation variation, irregular boundaries, and strong cross-regional heterogeneity, making parcel extraction a more challenging problem that jointly requires visual recognition, semantic discrimination, and terrain-aware geometric understanding. Although recent studies have advanced visual parcel benchmarks and image-text farmland understanding, a unified benchmark for complex terraced parcel extraction under aligned image-text-DEM settings remains absent. To fill this gap, we present GTPBD-MM, the first multimodal benchmark…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Datasets

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.