# Crop classification method for multi-temporal remote sensing imagery based on a (3 + 2)D SAFPN

**Authors:** Yicong Sun, Tingting Zhao, Yue Zhang, Xia Yu, Liqian Zhang, Yunli Bai

PMC · DOI: 10.3389/fpls.2026.1765836 · Frontiers in Plant Science · 2026-02-10

## TL;DR

This paper introduces a new neural network model for crop classification using multi-temporal satellite data, achieving high accuracy and strong generalization.

## Contribution

The novel (3+2)D SAFPN model integrates spatiotemporal and multi-scale features with a split-attention mechanism for improved crop classification.

## Key findings

- The (3+2)D SAFPN achieved 89.01% accuracy on the test set and 89.06% on the validation set.
- The model outperformed the baseline by 2.88% on the Munich dataset test set.
- The approach effectively combines spatial, spectral, and temporal information for crop mapping.

## Abstract

Accurate crop classification plays a critical role in agricultural monitoring and food security assurance. Effectively exploiting spatiotemporal information from multi-temporal remote sensing data remains a key challenge in crop mapping.This study proposes an improved neural network model, termed the (3+2)D Split-Attention Feature Pyramid Network ((3+2)D SAFPN), which is built upon a hybrid 3D–2D Feature Pyramid Network ((3+2)D FPN). The model integrates a 3D FPN to capture spatiotemporal crop dynamics, a 2D FPN to extract multi-scale spatial features, a split-attention (SA) mechanism to enhance inter-channel information interaction, and a focal loss function to improve learning performance on minority crop classes. Multi-temporal Sentinel-2 imagery acquired in 2024 was used to construct a plot-level NDVI time-series dataset for Talhu Town, Wuyuan County, Bayannur City, Inner Mongolia. The dataset was divided into training, validation, and test sets with a ratio of 6:2:2.Experimental results demonstrate that the proposed (3+2)D SAFPN model achieved overall accuracies of 89.01% and 89.06% on the test and validation sets, respectively, with Kappa coefficients of 0.82 for both sets, outperforming the original (3+2)D FPN model. Furthermore, comparative experiments conducted on the public Munich dataset indicate strong generalization ability, with accuracy improvements of 2.88% on the test set and 2.44% on the validation set compared to the baseline model.The results indicate that the (3+2)D SAFPN model effectively integrates spatial, spectral, and temporal information from multi-temporal remote sensing imagery, providing a robust and high-accuracy solution for crop classification tasks. This approach shows strong potential for large-scale agricultural monitoring applications. The source code of the proposed model is publicly available at: https://gitee.com/btgw/YicongSun/ree/(3+2)D-SAFPN_torch.

## Full-text entities

- **Diseases:** CE (MESH:C537866), NDVI (MESH:D018458)
- **Chemicals:** chlorophyll (MESH:D002734)
- **Species:** Solanum tuberosum (potatoes, species) [taxon 4113], Lathyrus oleraceus (garden pea, species) [taxon 3888], Cucurbita melopepo (species) [taxon 3665], Beta vulgaris subsp. vulgaris (field beet, subspecies) [taxon 3555], Solanum lycopersicum (tomato, species) [taxon 4081], Cucumis melo var. inodorus (casaba melon, varietas) [taxon 357961], Helianthus (sunflowers, genus) [taxon 4231], Helianthus annuus (common sunflower, species) [taxon 4232]
- **Mutations:** V100S

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12929445/full.md

## Figures

16 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12929445/full.md

## References

42 references — full list in the complete paper: https://tomesphere.com/paper/PMC12929445/full.md

---
Source: https://tomesphere.com/paper/PMC12929445