# Multimodal contrastive learning for non-invasive chondroid bone tumor classification and grading using radiographs

**Authors:** Peiying Hua, Jessica M. Sin, Eric R. Henderson, Jason Ha, Darcy A. Kerr, Saeed Hassanpour

PMC · DOI: 10.1186/s12880-026-02239-w · 2026-02-20

## TL;DR

This study uses AI to classify and grade chondroid bone tumors from radiographs, offering a non-invasive alternative to histology.

## Contribution

A novel contrastive learning framework that integrates radiographic and histological data for tumor classification and grading.

## Key findings

- A radiograph-based model achieved an AUC of 0.91 for tumor classification.
- Contrastive learning improved grading performance from AUC 0.86 to 0.95 using radiographs alone.
- Multimodal integration achieved perfect discrimination (AUC = 1.00) when combining radiographs and histology.

## Abstract

Diagnosing and grading chondroid bone tumors with radiography is difficult due to overlapping features with benign conditions such as avascular necrosis and fibrous dysplasia. Histology remains the gold standard, but it is invasive, costly, and not always accessible. This pilot study introduces a contrastive learning framework to address two challenges: (1) developing an AI system for accurate tumor classification and grading using radiographs alone, and (2) creating an enhanced multimodal pipeline when histology is available.

We retrospectively analyzed radiographs from 188 patients and histology images from 63 patients at a tertiary academic medical center in the United States. The stepwise framework included: (1) unimodal classification, (2) contrastive learning to align cross-modal embeddings, and (3) classification using enhanced representations. Models were trained with 5-fold cross-validation and evaluated using AUC, accuracy, sensitivity, and specificity.

Our framework demonstrates promising performance across all tasks. The radiograph-based model achieved an AUC of 0.91 (95%CI: 0.82–1.00) in distinguishing tumors from avascular necrosis and fibrous dysplasia. For grading, contrastive learning improved radiograph-only performance from AUC 0.86 to 0.95 (95%CI: 0.85–1.00). The histology-only model improved from AUC 0.73 to 0.83 with contrastive enhancement. Multimodal integration achieved perfect discrimination (AUC = 1.00) on the available subset.

This study establishes proof-of-concept that contrastive learning can effectively bridge radiographic and histological representations for tumor assessment. The framework offers clinical potential by enabling non-invasive classification in resource-limited settings while allowing multimodal enhancement. These results warrant validation in larger, multi-institutional cohorts.

The online version contains supplementary material available at 10.1186/s12880-026-02239-w.

## Linked entities

- **Diseases:** avascular necrosis (MONDO:0018373), fibrous dysplasia (MONDO:0000845)

## Full-text entities

- **Diseases:** chondroid bone tumor (MESH:D001859)

## Figures

1 figure with captions in the complete paper: https://tomesphere.com/paper/PMC13032380/full.md

---
Source: https://tomesphere.com/paper/PMC13032380