Perturb-and-Restore: Simulation-driven Structural Augmentation Framework for Imbalance Chromosomal Anomaly Detection
Yilan Zhang, Hanbiao Chen, Changchun Yang, Yuetan Chu, Siyuan Chen, Jing Wu, Jingdong Hu, Na Li, Junkai Su, Yuxuan Chen, Ao Xu, Xin Gao, Aihua Yin

TL;DR
This paper introduces a simulation-driven augmentation framework called Perturb-and-Restore to improve chromosomal anomaly detection by generating synthetic abnormal data, addressing data scarcity and imbalance issues.
Contribution
The novel Perturb-and-Restore framework combines structure perturbation, restoration simulation, and adaptive sampling to enhance deep learning performance on imbalanced chromosome anomaly datasets.
Findings
Achieved state-of-the-art detection performance with over 260,000 chromosome images.
Surpassed existing methods by approximately 9% in sensitivity and 14% in F1-score.
Effectively alleviated data imbalance with synthetic abnormal chromosome generation.
Abstract
Detecting structural chromosomal abnormalities is crucial for accurate diagnosis and management of genetic disorders. However, collecting sufficient structural abnormality data is extremely challenging and costly in clinical practice, and not all abnormal types can be readily collected. As a result, deep learning approaches face significant performance degradation due to the severe imbalance and scarcity of abnormal chromosome data. To address this challenge, we propose a Perturb-and-Restore (P&R), a simulation-driven structural augmentation framework that effectively alleviates data imbalance in chromosome anomaly detection. The P&R framework comprises two key components: (1) Structure Perturbation and Restoration Simulation, which generates synthetic abnormal chromosomes by perturbing chromosomal banding patterns of normal chromosomes followed by a restoration diffusion network that…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
