# Data augmentation method for computer-aided diagnosis using specular reflection

**Authors:** Youmin Shin, Jeonga Seol, Changwoo Lee, Jung Kim, Jinwook Choi, Jinbae Park, Soonwhan Kang, Gyuseon Song, Jung Ho Bae, Young-Gon Kim

PMC · DOI: 10.1007/s13534-025-00533-0 · 2026-01-13

## TL;DR

This paper introduces a new data augmentation method using specular reflection to improve deep learning models for colon polyp diagnosis during colonoscopies.

## Contribution

The novel contribution is a specular reflection-based data augmentation technique that enhances model performance, especially with limited training data.

## Key findings

- The SR augmentation method improved model accuracy in limited data scenarios.
- SR augmentation showed robustness across different deep learning architectures.
- The method outperformed conventional augmentation techniques in stress tests.

## Abstract

Colorectal cancer (CRC) is a significant global health challenge, emphasizing the importance of effective screening by applying methods like colonoscopy. While advanced imaging technologies, such as narrow-band imaging (NBI), allow real-time optical diagnosis of colon polyps, variations in endoscopist skills and unnecessary polypectomy underscore the need for artificial intelligence applications, particularly deep learning (DL) in computer-aided polyp detection and diagnosis (CADe and CADx). This study developed and investigated a data augmentation technique using specular reflection (SR) to enhance the robustness and performance of DL models tailored explicitly for CADx in colonoscopy. This SR augmentation method included SR generation and inpainting integrated into conventional augmentation techniques. We utilized two DL architectures: a convolutional neural network and a vision transformer. Stress tests, under varying data usage ratios using a dataset of 2,616 NBI images, revealed the robustness of SR augmentation, especially in scenarios with limited training data, highlighting its superiority over other methods. SR augmentation effectively improved model accuracy, particularly in scenarios with limited data, supporting its practical implementation in real-world colonoscopy environments. The findings emphasize the significance of domain-specific data augmentation techniques to support DL application in colonoscopy imaging for more reliable and accurate CADx systems for colon polyps.

The online version contains supplementary material available at 10.1007/s13534-025-00533-0.

## Linked entities

- **Diseases:** colorectal cancer (MONDO:0005575)

## Full-text entities

- **Diseases:** polyp (MESH:D011127), colon polyps (MESH:D003111), CRC (MESH:D015179)

## Figures

5 figures with captions in the complete paper: https://tomesphere.com/paper/PMC13013768/full.md

---
Source: https://tomesphere.com/paper/PMC13013768