Learning to Disentangle Latent Reasoning Rules with Language VAEs: A Systematic Study
Yingji Zhang, Marco Valentino, Danilo S. Carvalho, Andr\'e Freitas

TL;DR
This paper systematically explores how explicit reasoning rules can be embedded and disentangled within Transformer-based language VAEs, improving interpretability and knowledge injection in language models.
Contribution
It introduces a complete pipeline for learning and disentangling reasoning rules in language VAEs, supported by a theoretical framework and practical architecture.
Findings
Disentangled reasoning rules form distinct clusters in feature space.
Injecting reasoning info into queries enhances memory retrieval.
Performance plateaus in mathematical reasoning tasks with increased data.
Abstract
Incorporating explicit reasoning rules within the latent space of language models (LMs) offers a promising pathway to enhance generalisation, interpretability, and controllability. While current Transformer-based language models have shown strong performance on Natural Language Inference (NLI) tasks, they often rely on memorisation rather than rule-based inference. This work investigates how reasoning rules can be explicitly embedded and memorised within the LMs through Language Variational Autoencoders (VAEs). We propose a complete pipeline for learning reasoning rules within Transformer-based language VAEs. This pipeline encompasses three rule-based reasoning tasks, a supporting theoretical framework, and a practical end-to-end architecture. The experiment illustrates the following findings: Disentangled reasoning: Under explicit signal supervision, reasoning rules - viewed as…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
