S2Sent: Nested Selectivity Aware Sentence Representation Learning

Jianxiang Zang; Nijia Mo; Yonda Wei; Meiling Ning; Hui Liu

arXiv:2508.18164·cs.CL·August 26, 2025

S2Sent: Nested Selectivity Aware Sentence Representation Learning

Jianxiang Zang, Nijia Mo, Yonda Wei, Meiling Ning, Hui Liu

PDF

TL;DR

S2Sent introduces a novel sentence representation method that adaptively fuses multi-block Transformer features using a nested selector, improving semantic encoding efficiency and performance.

Contribution

The paper proposes S2Sent, a nested selectivity mechanism for Transformer encoders, enhancing sentence representations by balancing semantic redundancy and information loss.

Findings

01

Achieves significant performance improvements over baseline methods.

02

Maintains low additional parameters and inference latency.

03

Demonstrates high scalability and integrability.

Abstract

The combination of Transformer-based encoders with contrastive learning represents the current mainstream paradigm for sentence representation learning. This paradigm is typically based on the hidden states of the last Transformer block of the encoder. However, within Transformer-based encoders, different blocks exhibit varying degrees of semantic perception ability. From the perspective of interpretability, the semantic perception potential of knowledge neurons is modulated by stimuli, thus rational cross-block representation fusion is a direction worth optimizing. To balance the semantic redundancy and loss across block fusion, we propose a sentence representation selection mechanism S\textsuperscript{2}Sent, which integrates a parameterized nested selector downstream of the Transformer-based encoder. This selector performs spatial selection (SS) and nested frequency selection (FS)…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.