Improving Text Semantic Similarity Modeling through a 3D Siamese Network
Jianxiang Zang, Hui Liu

TL;DR
This paper introduces a 3D Siamese network that enhances text semantic similarity modeling by preserving hierarchical information and enabling more complex downstream analysis, outperforming traditional 2D approaches.
Contribution
The paper proposes a novel 3D Siamese network architecture that maps semantic information into higher-dimensional tensors, retaining more detailed hierarchical features for improved similarity modeling.
Findings
Outperforms traditional 2D Siamese networks on four benchmarks.
Effectively captures hierarchical semantic information.
Demonstrates improved efficiency and effectiveness.
Abstract
Siamese networks have gained popularity as a method for modeling text semantic similarity. Traditional methods rely on pooling operation to compress the semantic representations from Transformer blocks in encoding, resulting in two-dimensional semantic vectors and the loss of hierarchical semantic information from Transformer blocks. Moreover, this limited structure of semantic vectors is akin to a flattened landscape, which restricts the methods that can be applied in downstream modeling, as they can only navigate this flat terrain. To address this issue, we propose a novel 3D Siamese network for text semantic similarity modeling, which maps semantic information to a higher-dimensional space. The three-dimensional semantic tensors not only retains more precise spatial and feature domain information but also provides the necessary structural condition for comprehensive downstream…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMultimodal Machine Learning Applications · Human Pose and Action Recognition · Topic Modeling
MethodsMulti-Head Attention · Attention Is All You Need · Linear Layer · Byte Pair Encoding · Position-Wise Feed-Forward Layer · Residual Connection · Absolute Position Encodings · Adam · Layer Normalization · Label Smoothing
