Ensemble Transformer for Efficient and Accurate Ranking Tasks: an   Application to Question Answering Systems

Yoshitomo Matsubara; Luca Soldaini; Eric Lind; Alessandro Moschitti

arXiv:2201.05767·cs.CL·January 3, 2025

Ensemble Transformer for Efficient and Accurate Ranking Tasks: an Application to Question Answering Systems

Yoshitomo Matsubara, Luca Soldaini, Eric Lind, Alessandro Moschitti

PDF

Open Access 1 Repo

TL;DR

This paper introduces CERBERUS, an efficient neural network that distills multiple large transformer models into a single smaller model, significantly improving answer sentence selection accuracy while reducing computational costs.

Contribution

The paper proposes CERBERUS, a novel multi-head distillation approach that preserves ensemble diversity in a compact model for question answering tasks.

Findings

01

CERBERUS outperforms single-model distillations on three datasets.

02

It rivals larger models with 2.7x more parameters and 2.5x slower.

03

The approach effectively captures heterogeneous transformer knowledge.

Abstract

Large transformer models can highly improve Answer Sentence Selection (AS2) tasks, but their high computational costs prevent their use in many real-world applications. In this paper, we explore the following research question: How can we make the AS2 models more accurate without significantly increasing their model complexity? To address the question, we propose a Multiple Heads Student architecture (named CERBERUS), an efficient neural network designed to distill an ensemble of large transformers into a single smaller model. CERBERUS consists of two components: a stack of transformer layers that is used to encode inputs, and a set of ranking heads; unlike traditional distillation technique, each of them is trained by distilling a different large transformer architecture in a way that preserves the diversity of the ensemble members. The resulting model captures the knowledge of…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

amazon-research/wqa-cerberus
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Expert finding and Q&A systems