Metacognitive Myopia in Large Language Models
Florian Scholten, Tobias R. Rebholz, and Mandy H\"utter

TL;DR
This paper introduces a cognitive-ecological framework called metacognitive myopia to explain and address biases in large language models, emphasizing the lack of monitoring and control components as root causes.
Contribution
It proposes a new theoretical framework for understanding LLM biases and suggests incorporating metacognitive regulation to mitigate these issues.
Findings
Identifies five symptoms of metacognitive myopia in LLMs.
Highlights the role of monitoring and control deficits in bias formation.
Suggests regulatory processes can improve LLM reliability.
Abstract
Large Language Models (LLMs) exhibit potentially harmful biases that reinforce culturally inherent stereotypes, cloud moral judgments, or amplify positive evaluations of majority groups. Previous explanations mainly attributed bias in LLMs to human annotators and the selection of training data. Consequently, they have typically been addressed with bottom-up approaches such as reinforcement learning or debiasing corpora. However, these methods only treat the effects of LLM biases by indirectly influencing the model architecture, but do not address the underlying causes in the computational process. Here, we propose metacognitive myopia as a cognitive-ecological framework that can account for a conglomerate of established and emerging LLM biases and provide a lever to address problems in powerful but vulnerable tools. Our theoretical framework posits that a lack of the two components of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRobotics and Automated Systems
MethodsBalanced Selection
