DGA-Net Dynamic Gaussian Attention Network for Sentence Semantic   Matching

Kun Zhang; Guangyi Lv; Meng Wang; and Enhong Chen

arXiv:2106.04905·cs.CL·June 10, 2021

DGA-Net Dynamic Gaussian Attention Network for Sentence Semantic Matching

Kun Zhang, Guangyi Lv, Meng Wang, and Enhong Chen

PDF

Open Access

TL;DR

This paper introduces DGA-Net, a novel attention model that combines static and dynamic attention mechanisms to improve sentence semantic matching by capturing both global and local contextual information.

Contribution

The paper proposes a Dynamic Gaussian Attention (DGA) mechanism integrated into a new network, enhancing attention effectiveness in sentence semantic matching tasks.

Findings

01

DGA-Net outperforms existing models on benchmark datasets.

02

The model effectively captures both global and local semantic information.

03

Experimental results show significant accuracy improvements.

Abstract

Sentence semantic matching requires an agent to determine the semantic relation between two sentences, where much recent progress has been made by the advancement of representation learning techniques and inspiration of human behaviors. Among all these methods, attention mechanism plays an essential role by selecting important parts effectively. However, current attention methods either focus on all the important parts in a static way or only select one important part at one attention step dynamically, which leaves a large space for further improvement. To this end, in this paper, we design a novel Dynamic Gaussian Attention Network (DGA-Net) to combine the advantages of current static and dynamic attention methods. More specifically, we first leverage pre-trained language model to encode the input sentences and construct semantic representations from a global perspective. Then, we…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Multimodal Machine Learning Applications