BERT-DRE: BERT with Deep Recursive Encoder for Natural Language Sentence Matching
Ehsan Tavan, Ali Rahmati, Maryam Najafi, Saeed Bibak, Zahed Rahmati

TL;DR
This paper introduces BERT-DRE, a deep recursive encoder added to BERT, which enhances natural language sentence matching by capturing more complex text features, outperforming BERT on multiple benchmarks.
Contribution
The paper proposes a novel deep recursive encoder architecture on top of BERT, improving its performance in sentence matching tasks across various datasets.
Findings
BERT-DRE outperforms BERT on all tested benchmarks.
Accuracy on religious dataset improved from 89.70% to 90.29%.
Deep recursive encoding enhances BERT's ability to capture text complexity.
Abstract
This paper presents a deep neural architecture, for Natural Language Sentence Matching (NLSM) by adding a deep recursive encoder to BERT so called BERT with Deep Recursive Encoder (BERT-DRE). Our analysis of model behavior shows that BERT still does not capture the full complexity of text, so a deep recursive encoder is applied on top of BERT. Three Bi-LSTM layers with residual connection are used to design a recursive encoder and an attention module is used on top of this encoder. To obtain the final vector, a pooling layer consisting of average and maximum pooling is used. We experiment our model on four benchmarks, SNLI, FarsTail, MultiNLI, SciTail, and a novel Persian religious questions dataset. This paper focuses on improving the BERT results in the NLSM task. In this regard, comparisons between BERT-DRE and BERT are conducted, and it is shown that in all cases, BERT-DRE…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Text Readability and Simplification
MethodsAttention Is All You Need · Linear Layer · Refunds@Expedia|||How do I get a full refund from Expedia? · Dense Connections · Multi-Head Attention · Layer Normalization · Softmax · Weight Decay · WordPiece · Adam
