# Lightweight plant phenotypic feature extraction via transferable attention head pruning in Vision Transformers

**Authors:** Yongsheng Xie, Xiaoxiao Zeng, Rifeng Wang, Wenxin Li

PMC · DOI: 10.3389/fpls.2026.1747598 · Frontiers in Plant Science · 2026-02-13

## TL;DR

This paper introduces a method to efficiently extract plant features using a streamlined attention mechanism in Vision Transformers, enabling accurate and fast analysis on low-power devices.

## Contribution

The novel TAHA framework combines cross-species transfer learning with dynamic head pruning to reduce computational costs while maintaining accuracy in plant phenotyping.

## Key findings

- TAHA achieves similar accuracy to full models but reduces computational costs by up to 40%.
- The framework allows gradual adaptation to new species without full retraining.
- Phylogenetic distance impacts performance, with a 7.2% F1-score drop for distant species transfers.

## Abstract

We propose a lightweight Multi-Head Self-Attention (MHSA) mechanism for plant phenotypic feature extraction, which integrates cross-species transfer learning with dynamic head pruning to improve efficiency without compromising accuracy. The primary challenge stems from minimizing redundant computations without compromising the model’s capacity to generalize over varied plant species, an issue intensified by the substantial dimensionality of attention mechanisms in Vision Transformers. Our solution, the Transferable Attention Head Alignment (TAHA) framework, operates in three stages: pre-training on a source species, cross-species alignment via a Domain Alignment Loss (DAL), and head pruning based on a transferability score. The framework selects and keeps solely the attention heads with the highest transferability, thus diminishing model intricacy without compromising the ability to distinguish phenotypic traits. Furthermore, the pruned MHSA module is smoothly combined with standard Transformer backbones, which makes efficient deployment on edge devices possible. Experiments were conducted on real edge hardware (Raspberry Pi 4, NVIDIA Jetson Nano) and GPU platforms, showing our approach attains accuracy similar to full-head models yet cuts computational expenses by as much as 40% (14.1 ms inference latency on Raspberry Pi 4, 519 M parameters). The method holds special importance for scalable plant phenotyping, in situations where computational capacity is frequently constrained yet generalization across species is essential. Moreover, the repeated alignment and pruning procedure permits gradual adjustment to novel species without complete retraining, which increases feasibility for agricultural applications in practical settings. Supplementary experiments on phylogenetically distant species (Arabidopsis → pine) demonstrate the framework’s generalization limits, with a 7.2% F1-score drop compared to close-species transfer (Arabidopsis → maize), highlighting the need for trait-specific head adaptation in distant transfers. The proposed method improves lightweight feature extraction by merging transfer learning and attention head optimization, achieving a balanced compromise between performance and efficiency.

## Linked entities

- **Species:** Arabidopsis (taxon 3701)

## Full-text entities

- **Diseases:** plant disease (MESH:D010939), lesion (MESH:D009059)
- **Species:** Arabidopsis thaliana (mouse-ear cress, species) [taxon 3702], Homo sapiens (human, species) [taxon 9606], Oryza sativa (Asian cultivated rice, species) [taxon 4530]

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12946057/full.md

## Figures

3 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12946057/full.md

## References

24 references — full list in the complete paper: https://tomesphere.com/paper/PMC12946057/full.md

---
Source: https://tomesphere.com/paper/PMC12946057