Inclusivity in Large Language Models: Personality Traits and Gender Bias in Scientific Abstracts
Naseela Pervez, Alexander J. Titus

TL;DR
This study evaluates three large language models for scientific abstract generation, revealing they often mimic human styles but exhibit gender biases, underscoring the need for more inclusive AI writing tools.
Contribution
It provides an analysis of gender bias and stylistic variation in LLM-generated scientific abstracts using the LIWC framework, highlighting biases in current models.
Findings
Models produce human-like scientific texts
Significant gender biases are present in generated content
Stylistic variations suggest biases in LLM outputs
Abstract
Large language models (LLMs) are increasingly utilized to assist in scientific and academic writing, helping authors enhance the coherence of their articles. Previous studies have highlighted stereotypes and biases present in LLM outputs, emphasizing the need to evaluate these models for their alignment with human narrative styles and potential gender biases. In this study, we assess the alignment of three prominent LLMs - Claude 3 Opus, Mistral AI Large, and Gemini 1.5 Flash - by analyzing their performance on benchmark text-generation tasks for scientific abstracts. We employ the Linguistic Inquiry and Word Count (LIWC) framework to extract lexical, psychological, and social features from the generated texts. Our findings indicate that, while these models generally produce text closely resembling human authored content, variations in stylistic features suggest significant gender…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAuthorship Attribution and Profiling · Computational and Text Analysis Methods · Hate Speech and Cyberbullying Detection
