Question Answering over Electronic Devices: A New Benchmark Dataset and a Multi-Task Learning based QA Framework
Abhilash Nandy, Soumya Sharma, Shubham Maddhashiya, Kapil Sachdeva,, Pawan Goyal, Niloy Ganguly

TL;DR
This paper introduces a new benchmark dataset and a multi-task learning framework for question answering over electronic manuals, significantly improving answer accuracy and demonstrating versatility across various scenarios.
Contribution
It creates a large E-Manual corpus, develops a specialized QA framework called EMQAP, and demonstrates substantial performance improvements over baselines.
Findings
40% ROUGE-L F1 score improvement over baseline
Effective multi-task learning for section identification and answer extraction
Versatile performance across different question types and datasets
Abstract
Answering questions asked from instructional corpora such as E-manuals, recipe books, etc., has been far less studied than open-domain factoid context-based question answering. This can be primarily attributed to the absence of standard benchmark datasets. In this paper we meticulously create a large amount of data connected with E-manuals and develop suitable algorithm to exploit it. We collect E-Manual Corpus, a huge corpus of 307,957 E-manuals and pretrain RoBERTa on this large corpus. We create various benchmark QA datasets which include question answer pairs curated by experts based upon two E-manuals, real user questions from Community Question Answering Forum pertaining to E-manuals etc. We introduce EMQAP (E-Manual Question Answering Pipeline) that answers questions pertaining to electronics devices. Built upon the pretrained RoBERTa, it harbors a supervised multi-task learning…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Text and Document Classification Technologies
MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · Multi-Head Attention · Attention Is All You Need · Linear Layer · EMQAP · Softmax · Linear Warmup With Linear Decay · Weight Decay · Attention Dropout · Dropout
