Toward IIT-Inspired Consciousness in LLMs: A Reward-Based Learning Framework
Hamid Reza Akbari, Mohammad Hossein Sameti, Amir M. Mansourian, Mohammad Hossein Rohban, Hossein Sameti

TL;DR
This paper introduces a reward-based learning framework inspired by Integrated Information Theory to imbue language models with consciousness-like qualities, leading to more concise outputs and improved calibration without external data.
Contribution
It formulates a novel IIT-inspired reward function for language models, demonstrating practical benefits like reduced output length and maintained accuracy through a simple, efficient training approach.
Findings
Up to 31% reduction in output length
Maintained accuracy on out-of-domain tasks
Improved confidence calibration
Abstract
The pursuit of Artificial General Intelligence (AGI) is a central goal in language model development, in which consciousness-like processing could serve as a key facilitator. While current language models are not conscious, they exhibit behaviors analogous to certain aspects of consciousness. This paper investigates the implementation of a leading theory of consciousness, Integrated Information Theory (IIT), within language models via a reward-based learning paradigm. IIT provides a formal, axiom-based mathematical framework for quantifying consciousness. Drawing inspiration from its core principles, we formulate a novel reward function that quantifies a text's causality, coherence and integration, characteristics associated with conscious processing. Empirically, it is found that optimizing for this IIT-inspired reward leads to more concise text generation. On out of domain tasks,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsCognitive Computing and Networks · Neurobiology of Language and Bilingualism · Embodied and Extended Cognition
