D4: Text-guided diffusion model-based domain adaptive data augmentation   for vineyard shoot detection

Kentaro Hirahara; Chikahito Nakane; Hajime Ebisawa; Tsuyoshi Kuroda,; Yohei Iwaki; Tomoyoshi Utsumi; Yuichiro Nomura; Makoto Koike; Hiroshi Mineno

arXiv:2409.04060·cs.CV·September 9, 2024

D4: Text-guided diffusion model-based domain adaptive data augmentation for vineyard shoot detection

Kentaro Hirahara, Chikahito Nakane, Hajime Ebisawa, Tsuyoshi Kuroda,, Yohei Iwaki, Tomoyoshi Utsumi, Yuichiro Nomura, Makoto Koike, Hiroshi Mineno

PDF

Open Access

TL;DR

This paper introduces D4, a text-guided diffusion model-based data augmentation technique that enhances vineyard shoot detection by generating diverse, annotated images, significantly improving detection accuracy and addressing data scarcity in agricultural applications.

Contribution

The study presents a novel generative data augmentation method using a text-guided diffusion model tailored for vineyard shoot detection, overcoming annotation challenges and domain diversity issues.

Findings

01

Improved mean average precision by up to 28.65% for bounding box detection.

02

Enhanced average precision by up to 13.73% for keypoint detection.

03

Effectively generated diverse annotated images for agricultural domain adaptation.

Abstract

In an agricultural field, plant phenotyping using object detection models is gaining attention. However, collecting the training data necessary to create generic and high-precision models is extremely challenging due to the difficulty of annotation and the diversity of domains. Furthermore, it is difficult to transfer training data across different crops, and although machine learning models effective for specific environments, conditions, or crops have been developed, they cannot be widely applied in actual fields. In this study, we propose a generative data augmentation method (D4) for vineyard shoot detection. D4 uses a pre-trained text-guided diffusion model based on a large number of original images culled from video data collected by unmanned ground vehicles or other means, and a small number of annotated datasets. The proposed method generates new annotated images with background…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSmart Agriculture and AI

MethodsDiffusion