Knowledge Efficient Deep Learning for Natural Language Processing
Hai Wang

TL;DR
This paper explores methods to enhance deep learning models for NLP by making them more knowledge efficient, reducing reliance on large annotated datasets through classical and modern techniques.
Contribution
It adapts classical knowledge-efficient approaches to modern deep NLP models, introducing four novel methods for low-resource settings.
Findings
KRDL effectively denoises weak supervision.
KRDL assists in evidence sentence retrieval.
Improved multilingual BERT with bilingual dictionaries.
Abstract
Deep learning has become the workhorse for a wide range of natural language processing applications. But much of the success of deep learning relies on annotated examples. Annotation is time-consuming and expensive to produce at scale. Here we are interested in methods for reducing the required quantity of annotated data -- by making the learning methods more knowledge efficient so as to make them more applicable in low annotation (low resource) settings. There are various classical approaches to making the models more knowledge efficient such as multi-task learning, transfer learning, weakly supervised and unsupervised learning etc. This thesis focuses on adapting such classical methods to modern deep learning models and algorithms. This thesis describes four works aimed at making machine learning models more knowledge efficient. First, we propose a knowledge rich deep learning model…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Text and Document Classification Technologies
MethodsLinear Layer · Cosine Annealing · Byte Pair Encoding · Discriminative Fine-Tuning · Refunds@Expedia|||How do I get a full refund from Expedia? · Residual Connection · Softmax · Linear Warmup With Cosine Annealing · Dense Connections · GPT
