From Text to Image: Exploring GPT-4Vision's Potential in Advanced Radiological Analysis across Subspecialties
Felix Busch, Tianyu Han, Marcus Makowski, Daniel Truhn, Keno Bressem, Lisa Adams

TL;DR
This paper investigates GPT-4Vision's ability to analyze radiological images, demonstrating its potential to improve diagnostic accuracy across various subspecialties compared to text-only models.
Contribution
It introduces GPT-4Vision as a multimodal model capable of recognizing radiological features directly from images, advancing AI applications in medical diagnostics.
Findings
GPT-4Vision can recognize radiological features from images.
It outperforms text-based GPT-4 in diagnostic tasks.
Potential to enhance radiological analysis across subspecialties.
Abstract
The study evaluates and compares GPT-4 and GPT-4Vision for radiological tasks, suggesting GPT-4Vision may recognize radiological features from images, thereby enhancing its diagnostic potential over text-based descriptions.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsArtificial Intelligence in Healthcare and Education · Radiology practices and education · Topic Modeling
MethodsAttention Is All You Need · Dense Connections · Dropout · Byte Pair Encoding · Softmax · Absolute Position Encodings · Layer Normalization · Linear Layer · Position-Wise Feed-Forward Layer · Label Smoothing
