BWArea Model: Learning World Model, Inverse Dynamics, and Policy for Controllable Language Generation
Chengxing Jia, Pengyuan Wang, Ziniu Li, Yi-Chen Li, Zhilong Zhang, Nan, Tang, Yang Yu

TL;DR
The BWArea model introduces a brain-inspired decision-making framework for controllable language generation, combining a world model, inverse dynamics, and policy to improve performance and alignment with downstream tasks.
Contribution
It proposes a novel decision-making based architecture for language models, inspired by human brain mechanisms, enhancing controllability and robustness against data noise.
Findings
Achieves competitive performance with 1B parameter LLMs using 30B tokens.
Maintains performance with dirty data, unlike auto-regressive models.
Outperforms auto-regressive LLMs on 9 out of 10 tasks in benchmarks.
Abstract
Large language models (LLMs) have catalyzed a paradigm shift in natural language processing, yet their limited controllability poses a significant challenge for downstream applications. We aim to address this by drawing inspiration from the neural mechanisms of the human brain, specifically Broca's and Wernicke's areas, which are crucial for language generation and comprehension, respectively. In particular, Broca's area receives cognitive decision signals from Wernicke's area, treating the language generation as an intricate decision-making process, which differs from the fully auto-regressive language generation of existing LLMs. In a similar vein, our proposed system, the BWArea model, conceptualizes language generation as a decision-making task. This model has three components: a language world model, an inverse dynamics model, and a cognitive policy. Like Wernicke's area, the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSpeech and dialogue systems · Language and cultural evolution
