Zero-Shot Estimation of Base Models' Weights in Ensemble of Machine   Reading Comprehension Systems for Robust Generalization

Razieh Baradaran; Hossein Amirkhani

arXiv:2106.16013·cs.CL·July 1, 2021

Zero-Shot Estimation of Base Models' Weights in Ensemble of Machine Reading Comprehension Systems for Robust Generalization

Razieh Baradaran, Hossein Amirkhani

PDF

Open Access

TL;DR

This paper introduces a zero-shot weighted ensemble approach to enhance out-of-domain robustness of machine reading comprehension models, effectively improving accuracy and domain adaptability without additional training.

Contribution

It proposes a novel zero-shot weight estimation method for ensemble models, addressing out-of-domain generalization in MRC systems.

Findings

01

Improved accuracy on out-of-domain datasets

02

Enhanced robustness against domain shifts

03

Effective ensemble aggregation based on estimated weights

Abstract

One of the main challenges of the machine reading comprehension (MRC) models is their fragile out-of-domain generalization, which makes these models not properly applicable to real-world general-purpose question answering problems. In this paper, we leverage a zero-shot weighted ensemble method for improving the robustness of out-of-domain generalization in MRC models. In the proposed method, a weight estimation module is used to estimate out-of-domain weights, and an ensemble module aggregate several base models' predictions based on their weights. The experiments indicate that the proposed method not only improves the final accuracy, but also is robust against domain changes.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Multimodal Machine Learning Applications · Natural Language Processing Techniques