Effectiveness of Deep Networks in NLP using BiDAF as an example architecture
Soumyendu Sarkar

TL;DR
This paper investigates the effectiveness of deep neural networks in NLP question answering, focusing on the BiDAF architecture, and explores how layered refinements and embeddings contribute to performance improvements.
Contribution
It evaluates various deep network architectures and embedding strategies within BiDAF, demonstrating their additive benefits for NLP question answering tasks.
Findings
Deep networks provide performance boosts in NLP models.
Refinements in lower layers additively improve results.
Ensembling layers enhances model accuracy.
Abstract
Question Answering with NLP has progressed through the evolution of advanced model architectures like BERT and BiDAF and earlier word, character, and context-based embeddings. As BERT has leapfrogged the accuracy of models, an element of the next frontier can be the introduction of deep networks and an effective way to train them. In this context, I explored the effectiveness of deep networks focussing on the model encoder layer of BiDAF. BiDAF with its heterogeneous layers provides the opportunity not only to explore the effectiveness of deep networks but also to evaluate whether the refinements made in lower layers are additive to the refinements made in the upper layers of the model architecture. I believe the next greatest model in NLP will in fact fold in a solid language modeling like BERT with a composite architecture which will bring in refinements in addition to generic…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Advanced Graph Neural Networks
MethodsAttention Is All You Need · Linear Layer · Residual Connection · Concatenated Skip Connection · Layer Normalization · Refunds@Expedia|||How do I get a full refund from Expedia? · Attention Dropout · Max Pooling · *Communicated@Fast*How Do I Communicate to Expedia? · Multi-Head Attention
