Tree Matching Networks for Natural Language Inference: Parameter-Efficient Semantic Understanding via Dependency Parse Trees
Jason Lunder

TL;DR
This paper introduces Tree Matching Networks, a parameter-efficient model leveraging dependency parse trees for natural language inference, outperforming transformer-based models in efficiency and comparable in accuracy.
Contribution
The paper proposes Tree Matching Networks that incorporate dependency parse trees into a graph matching framework, improving learning efficiency and reducing resource requirements for NLI tasks.
Findings
TMN achieves better accuracy than BERT on SNLI with less memory and training time.
Explicit structural representations outperform sequence-based models at similar scales.
Multi-headed attention aggregation improves scalability of structural models.
Abstract
In creating sentence embeddings for Natural Language Inference (NLI) tasks, using transformer-based models like BERT leads to high accuracy, but require hundreds of millions of parameters. These models take in sentences as a sequence of tokens, and learn to encode the meaning of the sequence into embeddings such that those embeddings can be used reliably for NLI tasks. Essentially, every word is considered against every other word in the sequence, and the transformer model is able to determine the relationships between them, entirely from scratch. However, a model that accepts explicit linguistic structures like dependency parse trees may be able to leverage prior encoded information about these relationships, without having to learn them from scratch, thus improving learning efficiency. To investigate this, we adapt Graph Matching Networks (GMN) to operate on dependency parse trees,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Language Processing Techniques · Topic Modeling · Advanced Graph Neural Networks
