CS-NLP team at SemEval-2020 Task 4: Evaluation of State-of-the-art NLP Deep Learning Architectures on Commonsense Reasoning Task
Sirwe Saeedi, Aliakbar Panahi, Seyran Saeedi, Alvis C Fong

TL;DR
This paper evaluates various deep learning architectures on a commonsense reasoning task, achieving high accuracy and demonstrating the effectiveness of reformulating classification as multiple choice questions.
Contribution
It introduces a novel approach of converting classification tasks into multiple choice questions to improve performance on commonsense reasoning benchmarks.
Findings
Achieved 96.06% accuracy on the first subtask.
Ranked within top six teams for identifying reasons for nonsensical statements.
Used GPT-2 for generating explanations with promising results.
Abstract
In this paper, we investigate a commonsense inference task that unifies natural language understanding and commonsense reasoning. We describe our attempt at SemEval-2020 Task 4 competition: Commonsense Validation and Explanation (ComVE) challenge. We discuss several state-of-the-art deep learning architectures for this challenge. Our system uses prepared labeled textual datasets that were manually curated for three different natural language inference subtasks. The goal of the first subtask is to test whether a model can distinguish between natural language statements that make sense and those that do not make sense. We compare the performance of several language models and fine-tuned classifiers. Then, we propose a method inspired by question/answering tasks to treat a classification problem as a multiple choice question task to boost the performance of our experimental results…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
