# Improving computer vision for plant pathology through advanced training techniques

**Authors:** Jamie R. Sykes, Katherine J. Denby, Daniel W. Franks

PMC · DOI: 10.1002/aps3.70010 · Applications in Plant Sciences · 2025-06-07

## TL;DR

This paper improves computer vision for cocoa disease detection using advanced training techniques like semi-supervised learning and a new loss function.

## Contribution

The study introduces dynamic focal loss and a new benchmark dataset for cocoa disease detection.

## Key findings

- Semi-supervised learning improved model generalization and reduced overfitting.
- Dynamic focal loss enhanced performance on difficult cases.
- A new benchmark dataset of 7220 cocoa images was created for better evaluation.

## Abstract

This study investigates advanced training techniques to improve the performance of convolutional neural networks for disease detection in cocoa, Theobroma cacao.

Despite recent stagnation in accuracy improvements in computer vision for image classification, our research demonstrates significant advancements in performance through semi‐supervised learning, specialised loss functions, and the inclusion of a non‐cocoa class.

Semi‐supervised learning reduced overfitting and enhanced generalisability, particularly for subtle symptoms. The non‐cocoa class exposed models to a broad range of relevant features, significantly improving model robustness and performance in difficult cases. Grad‐CAM for qualitative assessment provided valuable insights into model behaviour, highlighting cases of overfitting missed by summary statistics. We also describe dynamic focal loss, a novel loss function that uses an empirical measure of difficulty to weight each image. Our results suggest that while PhytNet shows promise in terms of computational efficiency and superior handling of difficult images, ResNet18 with semi‐supervised learning and dynamic focal loss emerged as the strongest contender for real‐world deployment.

This research underscores the potential of semi‐supervised learning and advanced loss functions in enhancing the applicability of deep learning models in agricultural disease management. It also presents a new high‐quality benchmark dataset of 7220 images of diseased and healthy cocoa trees, offering a much greater and more realistic challenge than the Plan Village dataset.

## Linked entities

- **Species:** Theobroma cacao (taxon 3641)

## Full-text entities

- **Species:** Theobroma cacao (cacao, species) [taxon 3641]

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12188622/full.md

## Figures

5 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12188622/full.md

## References

39 references — full list in the complete paper: https://tomesphere.com/paper/PMC12188622/full.md

---
Source: https://tomesphere.com/paper/PMC12188622