Demographic Predictability in 3D CT Foundation Embeddings
Guangyao Zheng, Michael A. Jacobs, Vishwa S. Parekh

TL;DR
This study evaluates whether 3D CT embeddings from self-supervised models encode demographic information, finding they effectively capture age and sex but less so race, highlighting implications for fairness and privacy in healthcare AI.
Contribution
It demonstrates that self-supervised 3D CT embeddings encode demographic data like age and sex, raising awareness of potential biases and privacy concerns in medical AI applications.
Findings
Embeddings predict age with RMSE of 3.8 years.
Embeddings classify sex with AUC of 0.998.
Race prediction is less accurate with AUC of 0.878.
Abstract
Self-supervised foundation models have recently been successfully extended to encode three-dimensional (3D) computed tomography (CT) images, with excellent performance across several downstream tasks, such as intracranial hemorrhage detection and lung cancer risk forecasting. However, as self-supervised models learn from complex data distributions, questions arise concerning whether these embeddings capture demographic information, such as age, sex, or race. Using the National Lung Screening Trial (NLST) dataset, which contains 3D CT images and demographic data, we evaluated a range of classifiers: softmax regression, linear regression, linear support vector machine, random forest, and decision tree, to predict sex, race, and age of the patients in the images. Our results indicate that the embeddings effectively encoded age and sex information, with a linear regression model achieving a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsPrivacy-Preserving Technologies in Data · Statistical Methods and Inference
MethodsLinear Regression · Softmax
