RadGenome-Anatomy: A Large-Scale Anatomy-Labeled Chest Radiograph Dataset via Physically Grounded Volumetric Projection
Shuchang Ye,Mingyuan Meng,Hao Wang,Usman Naseem,Jinman Kim

TL;DR
This paper introduces RadGenome-Anatomy, the largest annotated chest radiograph dataset created by projecting 3D anatomical masks from CT scans into 2D radiographic space, enabling improved medical image analysis.
Contribution
It presents a novel large-scale dataset with physically grounded annotations, shifting from 2D tracing to volumetric projection for better anatomical labeling.
Findings
Over 10 million segmentation masks across 210 structures
Achieved high diagnostic accuracy for cardiomegaly, kyphosis, and scoliosis
Enables research on geometric measurements for radiograph interpretation
Abstract
Anatomical structure labels for chest radiographs are essential for medical image segmentation and a broad range of downstream diagnostic tasks. However, annotating anatomy directly on 2D chest radiographs is labor-intensive and intrinsically ambiguous, as 3D anatomical structures are projected onto a single 2D plane where boundaries may overlap, be occluded, or appear only partially visible. Consequently, existing anatomy-labeled chest radiograph datasets remain limited in scale, anatomy coverage, and label reliability. To address these limitations, we introduce RadGenome-Anatomy, the largest anatomy-labeled chest radiograph dataset, containing over 10 million segmentation masks across 210 anatomical structures in 25,692 studies. It is constructed by projecting large-scale 3D anatomical masks from CT volumes into 2D radiographic space through canonical radiographic geometry. This…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
