S2Sent: Nested Selectivity Aware Sentence Representation Learning
Jianxiang Zang, Nijia Mo, Yonda Wei, Meiling Ning, Hui Liu

TL;DR
S2Sent introduces a novel sentence representation method that adaptively fuses multi-block Transformer features using a nested selector, improving semantic encoding efficiency and performance.
Contribution
The paper proposes S2Sent, a nested selectivity mechanism for Transformer encoders, enhancing sentence representations by balancing semantic redundancy and information loss.
Findings
Achieves significant performance improvements over baseline methods.
Maintains low additional parameters and inference latency.
Demonstrates high scalability and integrability.
Abstract
The combination of Transformer-based encoders with contrastive learning represents the current mainstream paradigm for sentence representation learning. This paradigm is typically based on the hidden states of the last Transformer block of the encoder. However, within Transformer-based encoders, different blocks exhibit varying degrees of semantic perception ability. From the perspective of interpretability, the semantic perception potential of knowledge neurons is modulated by stimuli, thus rational cross-block representation fusion is a direction worth optimizing. To balance the semantic redundancy and loss across block fusion, we propose a sentence representation selection mechanism S\textsuperscript{2}Sent, which integrates a parameterized nested selector downstream of the Transformer-based encoder. This selector performs spatial selection (SS) and nested frequency selection (FS)…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
