Textual Self-attention Network: Test-Time Preference Optimization through Textual Gradient-based Attention
Shibing Mo, Haoyang Ruan, Kai Wu, Jing Liu

TL;DR
This paper introduces TSAN, a parameter-free, text-based self-attention mechanism that optimizes language model responses at test time by analyzing, weighing, and synthesizing multiple candidates to better align with human preferences.
Contribution
TSAN is a novel, parameter-free test-time preference optimization method that emulates self-attention in natural language, enabling effective synthesis of multiple candidate responses without fine-tuning.
Findings
TSAN outperforms supervised models with just three test-time iterations.
TSAN surpasses existing test-time alignment methods.
The method effectively leverages multiple candidate responses for better alignment.
Abstract
Large Language Models (LLMs) have demonstrated remarkable generalization capabilities, but aligning their outputs with human preferences typically requires expensive supervised fine-tuning. Recent test-time methods leverage textual feedback to overcome this, but they often critique and revise a single candidate response, lacking a principled mechanism to systematically analyze, weigh, and synthesize the strengths of multiple promising candidates. Such a mechanism is crucial because different responses may excel in distinct aspects (e.g., clarity, factual accuracy, or tone), and combining their best elements may produce a far superior outcome. This paper proposes the Textual Self-Attention Network (TSAN), a new paradigm for test-time preference optimization that requires no parameter updates. TSAN emulates self-attention entirely in natural language to overcome this gap: it analyzes…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Multimodal Machine Learning Applications · Advanced Graph Neural Networks
