Imaginations of WALL-E : Reconstructing Experiences with an Imagination-Inspired Module for Advanced AI Systems
Zeinab Sadat Taghavi, Soroush Gooran, Seyed Arshan Dalili, Hamidreza, Amirzadeh, Mohammad Jalal Nematbakhsh, Hossein Sameti

TL;DR
This paper presents an imagination-inspired AI system that reconstructs experiences to generate interpretable, multimodal perceptions, outperforming existing large language models in emotion recognition and question-answering tasks.
Contribution
The work introduces a novel imagination-inspired module for AI that enables independent, interpretable perceptions and cross-modal information synthesis, advancing beyond traditional language processing models.
Findings
Outperformed existing LLMs on MELD, IEMOCAP, and CoQA datasets.
Achieved higher F1 scores indicating better emotion recognition and QA performance.
Demonstrated the system's ability to generate deep, interpretable multimodal information.
Abstract
In this paper, we introduce a novel Artificial Intelligence (AI) system inspired by the philosophical and psychoanalytical concept of imagination as a ``Re-construction of Experiences". Our AI system is equipped with an imagination-inspired module that bridges the gap between textual inputs and other modalities, enriching the derived information based on previously learned experiences. A unique feature of our system is its ability to formulate independent perceptions of inputs. This leads to unique interpretations of a concept that may differ from human interpretations but are equally valid, a phenomenon we term as ``Interpretable Misunderstanding". We employ large-scale models, specifically a Multimodal Large Language Model (MLLM), enabling our proposed system to extract meaningful information across modalities while primarily remaining unimodal. We evaluated our system against other…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Computational and Text Analysis Methods
