On the Low-Rank Parametrization of Reward Models for Controlled Language Generation
Sergey Troshin, Vlad Niculae, Antske Fokkens

TL;DR
This paper introduces a low-rank parametrization for external expert models guiding language generation, achieving comparable performance to higher-rank models with improved efficiency and reduced computational costs.
Contribution
It proposes a low-rank expert model for reward-guided decoding that maintains effectiveness while significantly reducing computational complexity.
Findings
Low-rank RAD matches high-rank RAD performance on detoxification and sentiment tasks.
Low-rank expert requires only one reward model call per token, enhancing efficiency.
Low-rank approach simplifies reward-augmented decoding without sacrificing quality.
Abstract
Language models trained on large amounts of data are known to produce inappropriate content in some cases and require careful tuning to be used in the real world. We revisit an effective and modular approach for controllability of the language models, when an external expert model guides the decoding. Particularly, we zoom in into the parametrization choice of an external expert, highlighting the difference between low-rank and higher-rank parametrizations. Higher-rank experts are designed to support high flexibility when representing the rewards, leading to higher computational costs during decoding. However, we demonstrate that they might not use their full flexibility. By analyzing the recently proposed reward-augmented decoding approach (RAD), which uses a higher-rank expert model, we introduce a simpler but more efficient low-rank parametrization of the expert model enabling fast…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAlgorithms and Data Compression · Advanced Image and Video Retrieval Techniques · Advanced Data Compression Techniques
MethodsBalanced Selection
