QADYNAMICS: Training Dynamics-Driven Synthetic QA Diagnostic for   Zero-Shot Commonsense Question Answering

Haochen Shi; Weiqi Wang; Tianqing Fang; Baixuan Xu; Wenxuan Ding; Xin; Liu; Yangqiu Song

arXiv:2310.11303·cs.CL·October 18, 2023·1 cites

QADYNAMICS: Training Dynamics-Driven Synthetic QA Diagnostic for Zero-Shot Commonsense Question Answering

Haochen Shi, Weiqi Wang, Tianqing Fang, Baixuan Xu, Wenxuan Ding, Xin, Liu, Yangqiu Song

PDF

Open Access 1 Repo

TL;DR

QADYNAMICS is a training dynamics-driven framework that refines synthetic QA data by analyzing training behavior, removing noise and false negatives, thereby improving zero-shot commonsense QA performance with less data.

Contribution

It introduces a novel training dynamics-based method for QA diagnostics and refinement, reducing noise in synthetic data and enhancing model generalization in zero-shot commonsense QA.

Findings

01

Outperforms baselines using only 33% of synthetic data

02

Significantly improves QA quality according to expert evaluations

03

Effective even with large language models like ChatGPT

Abstract

Zero-shot commonsense Question-Answering (QA) requires models to reason about general situations beyond specific benchmarks. State-of-the-art approaches fine-tune language models on QA pairs constructed from CommonSense Knowledge Bases (CSKBs) to equip the models with more commonsense knowledge in a QA context. However, current QA synthesis protocols may introduce noise from the CSKBs and generate ungrammatical questions and false negative options, which impede the model's ability to generalize. To address these issues, we propose QADYNAMICS, a training dynamics-driven framework for QA diagnostics and refinement. Our approach analyzes the training dynamics of each QA pair at both the question level and option level, discarding machine-detectable artifacts by removing uninformative QA pairs and mislabeled or false-negative options. Extensive experiments demonstrate the effectiveness of…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

hkust-knowcomp/qadynamics
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Biomedical Text Mining and Ontologies