A Pseudo-Semantic Loss for Autoregressive Models with Logical   Constraints

Kareem Ahmed; Kai-Wei Chang; Guy Van den Broeck

arXiv:2312.03905·cs.LG·January 30, 2024·1 cites

A Pseudo-Semantic Loss for Autoregressive Models with Logical Constraints

Kareem Ahmed, Kai-Wei Chang, Guy Van den Broeck

PDF

Open Access 1 Video

TL;DR

This paper introduces a pseudo-semantic loss method for autoregressive models that efficiently incorporates logical constraints, improving logical consistency and detoxification in language models without computational intractability.

Contribution

It proposes a local, pseudolikelihood-based approximation for enforcing logical constraints in autoregressive models, enabling scalable neuro-symbolic learning.

Findings

01

Enhanced logical consistency in Sudoku and path prediction tasks.

02

Achieved state-of-the-art detoxification of language models against toxic outputs.

03

Improved model alignment with logical constraints without significant computational overhead.

Abstract

Neuro-symbolic AI bridges the gap between purely symbolic and neural approaches to learning. This often requires maximizing the likelihood of a symbolic constraint w.r.t the neural network's output distribution. Such output distributions are typically assumed to be fully-factorized. This limits the applicability of neuro-symbolic learning to the more expressive autoregressive distributions, e.g., transformers. Under such distributions, computing the likelihood of even simple constraints is #P-hard. Instead of attempting to enforce the constraint on the entire output distribution, we propose to do so on a random, local approximation thereof. More precisely, we optimize the likelihood of the constraint under a pseudolikelihood-based approximation centered around a model sample. Our approximation is factorized, allowing the reuse of solutions to sub-problems, a main tenet for efficiently…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

A Pseudo-Semantic Loss for Autoregressive Models with Logical Constraints· slideslive

Taxonomy

TopicsNeural Networks and Applications

MethodsBalanced Selection