Towards Neural Programming Interfaces
Zachary C. Brown, Nathaniel Robinson, David Wingate, Nancy Fulda

TL;DR
This paper introduces Neural Programming Interfaces (NPIs), specialized neural networks that control pretrained language models' outputs by manipulating internal activations, enabling task-specific control without altering original model weights.
Contribution
It proposes a novel NPI framework, a new data set construction algorithm, and a GAN-inspired loss function for controlling autoregressive transformers.
Findings
Successfully controlled noun selection, topic aversion, and offensive speech filtering.
Maintained language fluency while controlling outputs.
Demonstrated effectiveness against state-of-the-art methods.
Abstract
It is notoriously difficult to control the behavior of artificial neural networks such as generative neural language models. We recast the problem of controlling natural language generation as that of learning to interface with a pretrained language model, just as Application Programming Interfaces (APIs) control the behavior of programs by altering hyperparameters. In this new paradigm, a specialized neural network (called a Neural Programming Interface or NPI) learns to interface with a pretrained language model by manipulating the hidden activations of the pretrained model to produce desired outputs. Importantly, no permanent changes are made to the weights of the original model, allowing us to re-purpose pretrained models for new tasks without overwriting any aspect of the language model. We also contribute a new data set construction algorithm and GAN-inspired loss function that…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsTopic Modeling · Adversarial Robustness in Machine Learning · Speech Recognition and Synthesis
MethodsLinear Layer · Cosine Annealing · Residual Connection · Refunds@Expedia|||How do I get a full refund from Expedia? · Dropout · Attention Is All You Need · Linear Warmup With Cosine Annealing · Byte Pair Encoding · Layer Normalization · Dense Connections
