Crowdsourcing Dermatology Images with Google Search Ads: Creating a Real-World Skin Condition Dataset
Abbi Ward, Jimmy Li, Julie Wang, Sriram Lakshminarasimhan, Ashley, Carrick, Bilson Campana, Jay Hartford, Pradeep Kumar S, Tiya, Tiyasirichokchai, Sunny Virmani, Renee Wong, Yossi Matias, Greg S. Corrado,, Dale R. Webster, Dawn Siegel, Steven Lin, Justin Ko

TL;DR
This study demonstrates that Google Search ads can effectively crowdsource a diverse, real-world dermatology image dataset, which includes demographic and condition labels, aiding AI and medical research.
Contribution
Introduces a scalable method using search ads to create a large, diverse dermatology image dataset with demographic and condition annotations.
Findings
Over 10,000 images collected from 5,033 contributors
Dataset shows demographic diversity and real-world condition representation
Dermatologist diagnosis confidence improves with more data variables
Abstract
Background: Health datasets from clinical sources do not reflect the breadth and diversity of disease in the real world, impacting research, medical education, and artificial intelligence (AI) tool development. Dermatology is a suitable area to develop and test a new and scalable method to create representative health datasets. Methods: We used Google Search advertisements to invite contributions to an open access dataset of images of dermatology conditions, demographic and symptom information. With informed contributor consent, we describe and release this dataset containing 10,408 images from 5,033 contributions from internet users in the United States over 8 months starting March 2023. The dataset includes dermatologist condition labels as well as estimated Fitzpatrick Skin Type (eFST) and Monk Skin Tone (eMST) labels for the images. Results: We received a median of 22…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsCutaneous Melanoma Detection and Management · Data-Driven Disease Surveillance · Body Image and Dysmorphia Studies
