Modeling Selective Feature Attention for Representation-based Siamese   Text Matching

Jianxiang Zang; Hui Liu

arXiv:2404.16776·cs.CL·April 26, 2024

Modeling Selective Feature Attention for Representation-based Siamese Text Matching

Jianxiang Zang, Hui Liu

PDF

Open Access 1 Repo

TL;DR

This paper introduces Feature Attention and Selective Feature Attention mechanisms for Siamese text matching networks, enhancing feature dependency modeling and semantic extraction, leading to improved performance across benchmarks.

Contribution

The paper proposes novel Feature Attention and Selective Feature Attention modules that dynamically emphasize important features and enable multi-scale semantic extraction in Siamese networks.

Findings

01

Feature Attention improves dependency modeling among features.

02

Selective Feature Attention enhances semantic extraction across abstraction levels.

03

The proposed modules outperform baseline models on multiple benchmarks.

Abstract

Representation-based Siamese networks have risen to popularity in lightweight text matching due to their low deployment and inference costs. While word-level attention mechanisms have been implemented within Siamese networks to improve performance, we propose Feature Attention (FA), a novel downstream block designed to enrich the modeling of dependencies among embedding features. Employing "squeeze-and-excitation" techniques, the FA block dynamically adjusts the emphasis on individual features, enabling the network to concentrate more on features that significantly contribute to the final classification. Building upon FA, we introduce a dynamic "selection" mechanism called Selective Feature Attention (SFA), which leverages a stacked BiGRU Inception structure. The SFA block facilitates multi-scale semantic extraction by traversing different stacked BiGRU layers, encouraging the network…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

hggzjx/sfa
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsText and Document Classification Technologies · Topic Modeling · Natural Language Processing Techniques

MethodsFeedback Alignment · Bidirectional GRU