CalCROP21: A Georeferenced multi-spectral dataset of Satellite Imagery and Crop Labels
Rahul Ghosh, Praveen Ravirathinam, Xiaowei Jia, Ankush Khandelwal,, David Mulla, Vipin Kumar

TL;DR
CalCROP21 is a new high-resolution, georeferenced multi-spectral dataset for crop mapping in California, enhanced by a novel attention-based segmentation algorithm that outperforms existing USDA labels.
Contribution
The paper introduces CalCROP21, a detailed crop dataset at 10m resolution, and proposes STATT, an attention-based model that improves crop classification accuracy over traditional USDA labels.
Findings
STATT outperforms resampled CDL labels in accuracy.
CalCROP21 provides a comprehensive, high-resolution crop dataset.
The dataset and pipeline are publicly available.
Abstract
Mapping and monitoring crops is a key step towards sustainable intensification of agriculture and addressing global food security. A dataset like ImageNet that revolutionized computer vision applications can accelerate development of novel crop mapping techniques. Currently, the United States Department of Agriculture (USDA) annually releases the Cropland Data Layer (CDL) which contains crop labels at 30m resolution for the entire United States of America. While CDL is state of the art and is widely used for a number of agricultural applications, it has a number of limitations (e.g., pixelated errors, labels carried over from previous errors and absence of input imagery along with class labels). In this work, we create a new semantic segmentation benchmark dataset, which we call CalCROP21, for the diverse crops in the Central Valley region of California at 10m spatial resolution using a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
