Making Neural Machine Reading Comprehension Faster

Debajyoti Chatterjee

arXiv:1904.00796·cs.CL·April 2, 2019·6 cites

Making Neural Machine Reading Comprehension Faster

Debajyoti Chatterjee

PDF

Open Access

TL;DR

This paper presents a method to accelerate neural machine reading comprehension by applying knowledge distillation to create smaller, faster models based on BERT, aiming to improve inference speed without sacrificing accuracy.

Contribution

The study introduces a knowledge distillation approach to develop faster, smaller BERT-based models for machine reading comprehension, enhancing inference efficiency.

Findings

01

Smaller models achieved faster inference times.

02

Distilled models maintained comparable accuracy to larger models.

03

Compared favorably with existing speed-optimized models.

Abstract

This study aims at solving the Machine Reading Comprehension problem where questions have to be answered given a context passage. The challenge is to develop a computationally faster model which will have improved inference time. State of the art in many natural language understanding tasks, BERT model, has been used and knowledge distillation method has been applied to train two smaller models. The developed models are compared with other models which have been developed with the same intention.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Multimodal Machine Learning Applications

MethodsLinear Layer · Knowledge Distillation · Residual Connection · Attention Dropout · Linear Warmup With Linear Decay · Weight Decay · Refunds@Expedia|||How do I get a full refund from Expedia? · Dense Connections · Adam · WordPiece