Can Public LLMs be used for Self-Diagnosis of Medical Conditions ?
Nikil Sharan Prabahar Balasubramanian, Sagnik Dakshit

TL;DR
This paper evaluates the potential of public large language models like GPT-4.0 and Gemini for self-diagnosis of medical conditions, highlighting their performance differences, limitations, and the impact of retrieval-augmented generation.
Contribution
It provides a comparative analysis of GPT-4.0 and Gemini on a large self-diagnosis dataset and discusses their limitations and potential in healthcare applications.
Findings
GPT-4.0 achieved 63.07% accuracy in self-diagnosis.
Gemini achieved only 6.01% accuracy.
Retrieval-augmented generation improves diagnostic performance.
Abstract
Advancements in deep learning have generated a large-scale interest in the development of foundational deep learning models. The development of Large Language Models (LLM) has evolved as a transformative paradigm in conversational tasks, which has led to its integration and extension even in the critical domain of healthcare. With LLMs becoming widely popular and their public access through open-source models and integration with other applications, there is a need to investigate their potential and limitations. One such crucial task where LLMs are applied but require a deeper understanding is that of self-diagnosis of medical conditions based on bias-validating symptoms in the interest of public health. The widespread integration of Gemini with Google search and GPT-4.0 with Bing search has led to a shift in the trend of self-diagnosis using search engines to conversational LLM models.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsArtificial Intelligence in Law
MethodsAttention Is All You Need · Dense Connections · Linear Layer · Position-Wise Feed-Forward Layer · Label Smoothing · Residual Connection · Absolute Position Encodings · Byte Pair Encoding · Adam · Dropout
