Syntactic Inductive Biases for Deep Learning Methods

Yikang Shen

arXiv:2206.04806·cs.LG·June 13, 2022

Syntactic Inductive Biases for Deep Learning Methods

Yikang Shen

PDF

Open Access

TL;DR

This thesis introduces syntactic inductive biases for deep learning, enabling models to better capture hierarchical and relational structures in data, improving interpretability and performance.

Contribution

It proposes two novel inductive biases for deep learning models—one for constituency and one for dependency structures—enhancing their ability to learn hierarchical and relational representations.

Findings

01

Models learn to process logical expressions based on syntactic structure.

02

Dependency bias enables models to induce graphs close to human annotations.

03

Biases improve performance on various natural language tasks.

Abstract

In this thesis, we try to build a connection between the two schools by introducing syntactic inductive biases for deep learning models. We propose two families of inductive biases, one for constituency structure and another one for dependency structure. The constituency inductive bias encourages deep learning models to use different units (or neurons) to separately process long-term and short-term information. This separation provides a way for deep learning models to build the latent hierarchical representations from sequential inputs, that a higher-level representation is composed of and can be decomposed into a series of lower-level representations. For example, without knowing the ground-truth structure, our proposed model learns to process logical expression through composing representations of variables and operators into representations of expressions according to its syntactic…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNeural Networks and Applications