Quantifying Similarity: Text-Mining Approaches to Evaluate ChatGPT and Google Bard Content in Relation to BioMedical Literature
Jakub Klimczak, Ahmed Abdeen Hamed

TL;DR
This study compares ChatGPT and Google Bard's generated biomedical content to real literature using text-mining and network analysis, finding ChatGPT produces more similar and interconnected content, potentially aiding hypothesis generation.
Contribution
It introduces a novel text-mining approach to evaluate and compare AI-generated biomedical content with real literature, highlighting ChatGPT's superior similarity metrics.
Findings
ChatGPT has higher cosine and Jaccard similarity scores than Google Bard.
ChatGPT's bigram networks reveal new links not present in literature.
ChatGPT's content shows better centrality measures, indicating more meaningful connections.
Abstract
Background: The emergence of generative AI tools, empowered by Large Language Models (LLMs), has shown powerful capabilities in generating content. To date, the assessment of the usefulness of such content, generated by what is known as prompt engineering, has become an interesting research question. Objectives Using the mean of prompt engineering, we assess the similarity and closeness of such contents to real literature produced by scientists. Methods In this exploratory analysis, (1) we prompt-engineer ChatGPT and Google Bard to generate clinical content to be compared with literature counterparts, (2) we assess the similarities of the contents generated by comparing them with counterparts from biomedical literature. Our approach is to use text-mining approaches to compare documents and associated bigrams and to use network analysis to assess the terms' centrality. Results The…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsArtificial Intelligence in Healthcare and Education · Machine Learning in Healthcare
