A Pseudo-Semantic Loss for Autoregressive Models with Logical Constraints
Kareem Ahmed, Kai-Wei Chang, Guy Van den Broeck

TL;DR
This paper introduces a pseudo-semantic loss method for autoregressive models that efficiently incorporates logical constraints, improving logical consistency and detoxification in language models without computational intractability.
Contribution
It proposes a local, pseudolikelihood-based approximation for enforcing logical constraints in autoregressive models, enabling scalable neuro-symbolic learning.
Findings
Enhanced logical consistency in Sudoku and path prediction tasks.
Achieved state-of-the-art detoxification of language models against toxic outputs.
Improved model alignment with logical constraints without significant computational overhead.
Abstract
Neuro-symbolic AI bridges the gap between purely symbolic and neural approaches to learning. This often requires maximizing the likelihood of a symbolic constraint w.r.t the neural network's output distribution. Such output distributions are typically assumed to be fully-factorized. This limits the applicability of neuro-symbolic learning to the more expressive autoregressive distributions, e.g., transformers. Under such distributions, computing the likelihood of even simple constraints is #P-hard. Instead of attempting to enforce the constraint on the entire output distribution, we propose to do so on a random, local approximation thereof. More precisely, we optimize the likelihood of the constraint under a pseudolikelihood-based approximation centered around a model sample. Our approximation is factorized, allowing the reuse of solutions to sub-problems, a main tenet for efficiently…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsNeural Networks and Applications
MethodsBalanced Selection
