Generating Synthetic Wildlife Health Data from Camera Trap Imagery: A Pipeline for Alopecia and Body Condition Training Data
David Brundage

TL;DR
This paper introduces a pipeline to generate synthetic wildlife health images from camera trap data, enabling improved training for automated health screening of species like North American wildlife.
Contribution
The authors develop a novel pipeline that creates high-quality synthetic images depicting health deterioration, facilitating machine learning models for wildlife health assessment.
Findings
Generated 553 quality-controlled synthetic images from 201 base images.
Achieved 0.85 AUROC in screening health conditions using only synthetic training data.
Pipeline effectively captures visual features relevant for health condition classification.
Abstract
No publicly available, ML ready datasets exist for wildlife health conditions in camera trap imagery, creating a fundamental barrier to automated health screening. We present a pipeline for generating synthetic training images depicting alopecia and body condition deterioration in wildlife from real camera trap photographs. Our pipeline constructs a curated base image set from iWildCam using MegaDetector derived bounding boxes and center frame weighted stratified sampling across 8 North American species. A generative phenotype editing system produces controlled severity variants depicting hair loss consistent with mange and emaciation. An adaptive scene drift quality control system uses a sham prefilter and decoupled mask then score approach with complementary day or night metrics to reject images where the generative model altered the original scene. We frame the pipeline explicitly as…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
