The Natural Language Decathlon: Multitask Learning as Question Answering
Bryan McCann, Nitish Shirish Keskar, Caiming Xiong, Richard, Socher

TL;DR
The paper introduces decaNLP, a multitask question answering framework covering ten diverse NLP tasks, and proposes MQAN, a unified model that learns all tasks jointly without task-specific modules, achieving strong transfer and zero-shot learning capabilities.
Contribution
It presents a new multitask question answering paradigm and a unified neural network model that handles multiple NLP tasks simultaneously without task-specific components.
Findings
MQAN improves transfer learning for translation and NER.
MQAN enables domain adaptation for sentiment analysis and NLI.
MQAN achieves zero-shot classification and state-of-the-art results on semantic parsing.
Abstract
Deep learning has improved performance on many natural language processing (NLP) tasks individually. However, general NLP models cannot emerge within a paradigm that focuses on the particularities of a single metric, dataset, and task. We introduce the Natural Language Decathlon (decaNLP), a challenge that spans ten tasks: question answering, machine translation, summarization, natural language inference, sentiment analysis, semantic role labeling, zero-shot relation extraction, goal-oriented dialogue, semantic parsing, and commonsense pronoun resolution. We cast all tasks as question answering over a context. Furthermore, we present a new Multitask Question Answering Network (MQAN) jointly learns all tasks in decaNLP without any task-specific modules or parameters in the multitask setting. MQAN shows improvements in transfer learning for machine translation and named entity…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Speech and dialogue systems · Natural Language Processing Techniques
