# Evaluation of the Applicability of an Overseas-Trained Artificial Intelligence System for Mammography Interpretation in Japan

**Authors:** Maya Makita, Kouzou Murakami, Wakana Murakami, Hiroko Takamatsu, Kanai Takahiro, Atsuhito Sekimoto, Yoshinori Ito, Yoshimitsu Ohgiya

PMC · DOI: 10.7759/cureus.101466 · Cureus · 2026-01-13

## TL;DR

This study evaluates if an AI system trained outside Japan can be used for breast cancer detection in Japanese mammograms, finding it performs reasonably well but needs further validation.

## Contribution

The study demonstrates the potential feasibility of using an overseas-trained AI system for mammography interpretation in Japan.

## Key findings

- The AI-CAD achieved 79% sensitivity and 89% specificity on Japanese validation data.
- AI assistance did not significantly improve radiologists' diagnostic performance in the reader study.
- The AI system's performance was comparable to unaided readings, suggesting potential applicability in Japan.

## Abstract

Background: Breast cancer remains a major public health issue in Japan, and artificial intelligence (AI)-based computer-aided detection (CAD) systems have the potential to enhance diagnostic performance. We conducted a two-phase evaluation of an AI-CAD trained on non-Japanese data: an external validation using Japanese mammography images and a reader study assessing its impact on diagnostic performance.

Methods: We performed an external validation to evaluate the diagnostic performance of a commercial AI-CAD system using full-field digital mammography (FFDM) images obtained from Japanese patients. This study primarily focused on evaluating the standalone diagnostic performance of an AI-CAD system using a validation cohort of 338 Japanese patients. To further assess its practical utility, a supplementary multi-reader study with 40 selected cases was conducted to observe the interaction between radiologists and AI output. The AI-CAD was developed and trained outside Japan. Diagnostic performance was assessed using sensitivity, specificity, and receiver operating characteristic curve analysis.

Results: On validation data, the AI-CAD achieved a sensitivity of 79%, specificity of 89%, and an area under the curve (AUC) of 0.897 (95% CI 0.860-0.934). In a reader study of 40 cases, their performance improved from an AUC of 0.750 to 0.756 (Breast Imaging Reporting and Data System (BI-RADS); p=0.505) and from 0.750 to 0.761 (Likelihood of Malignancy; p=0.110) when assisted by AI-CAD.

Conclusions: Although no statistically significant difference was observed, AI‐aided readings yielded AUCs comparable to AI-unaided readings (95% CI overlap); these findings suggest the feasibility of applying an AI‑CAD trained outside Japan to Japanese cases, while larger prospective screening studies are required to establish clinical impact.

## Linked entities

- **Diseases:** breast cancer (MONDO:0004989)

## Full-text entities

- **Diseases:** Breast cancer (MESH:D001943)
- **Species:** Homo sapiens (human, species) [taxon 9606]

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12900558/full.md

## Figures

2 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12900558/full.md

## References

23 references — full list in the complete paper: https://tomesphere.com/paper/PMC12900558/full.md

---
Source: https://tomesphere.com/paper/PMC12900558