Zero-Shot Controlled Generation with Encoder-Decoder Transformers
Devamanyu Hazarika, Mahdi Namazifar, Dilek Hakkani-T\"ur

TL;DR
This paper introduces three novel zero-shot control mechanisms for encoder-decoder transformer-based natural language generation models, enabling attribute manipulation without additional training or data, while maintaining performance.
Contribution
It presents three innovative control knobs—attention biasing, decoder mixing, and context augmentation—that manipulate trained models at generation time for zero-shot control.
Findings
Control knobs effectively manipulate output attributes.
Model robustness to control manipulations demonstrated.
Insights into transformer decoder's role in fluency.
Abstract
Controlling neural network-based models for natural language generation (NLG) has broad applications in numerous areas such as machine translation, document summarization, and dialog systems. Approaches that enable such control in a zero-shot manner would be of great importance as, among other reasons, they remove the need for additional annotated data and training. In this work, we propose novel approaches for controlling encoder-decoder transformer-based NLG models in zero-shot. This is done by introducing three control knobs, namely, attention biasing, decoder mixing, and context augmentation, that are applied to these models at generation time. These knobs control the generation process by directly manipulating trained NLG models (e.g., biasing cross-attention layers) to realize the desired attributes in the generated outputs. We show that not only are these NLG models robust to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Multimodal Machine Learning Applications
