Not Eliminate but Aggregate: Post-Hoc Control over Mixture-of-Experts to   Address Shortcut Shifts in Natural Language Understanding

Ukyo Honda; Tatsushi Oka; Peinan Zhang; Masato Mita

arXiv:2406.12060·cs.CL·November 19, 2024

Not Eliminate but Aggregate: Post-Hoc Control over Mixture-of-Experts to Address Shortcut Shifts in Natural Language Understanding

Ukyo Honda, Tatsushi Oka, Peinan Zhang, Masato Mita

PDF

Open Access 1 Repo 1 Video

TL;DR

This paper proposes a post-hoc aggregation method for mixture-of-experts models to improve robustness against shortcut reliance and distribution shifts in natural language understanding tasks.

Contribution

It introduces a novel post-hoc control technique for mixture-of-experts to mitigate shortcut dependence without retraining the entire model.

Findings

01

Enhanced robustness to shortcut-induced distribution shifts

02

Post-hoc aggregation improves model reliability

03

Practical advantages over previous elimination methods

Abstract

Recent models for natural language understanding are inclined to exploit simple patterns in datasets, commonly known as shortcuts. These shortcuts hinge on spurious correlations between labels and latent features existing in the training data. At inference time, shortcut-dependent models are likely to generate erroneous predictions under distribution shifts, particularly when some latent features are no longer correlated with the labels. To avoid this, previous studies have trained models to eliminate the reliance on shortcuts. In this study, we explore a different direction: pessimistically aggregating the predictions of a mixture-of-experts, assuming each expert captures relatively different latent features. The experimental results demonstrate that our post-hoc control over the experts significantly enhances the model's robustness to the distribution shift in shortcuts. Besides, we…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

CyberAgentAILab/posthoc-control-moe
pytorchOfficial

Videos

Not Eliminate but Aggregate: Post-Hoc Control over Mixture-of-Experts to Address Shortcut Shifts in Natural Language Understanding· underline

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Speech and dialogue systems