Multi-Layer Ensembling Techniques for Multilingual Intent Classification
Charles Costello, Ruixi Lin, Vishwas Mruthyunjaya, Bettina Bolla,, Charles Jankowski

TL;DR
This paper introduces a novel multi-layer ensembling method that combines different models and initializations to significantly improve multilingual intent classification accuracy across multiple datasets, including a new banking domain dataset.
Contribution
The paper presents a new multi-layer ensembling approach and demonstrates its effectiveness in multilingual intent classification, outperforming or matching state-of-the-art results across diverse datasets.
Findings
Ensembling yields significant performance gains.
Multi-layer ensembling improves accuracy without risk.
Diverse simple models can rival complex state-of-the-art models.
Abstract
In this paper we determine how multi-layer ensembling improves performance on multilingual intent classification. We develop a novel multi-layer ensembling approach that ensembles both different model initializations and different model architectures. We also introduce a new banking domain dataset and compare results against the standard ATIS dataset and the Chinese SMP2017 dataset to determine ensembling performance in multilingual and multi-domain contexts. We run ensemble experiments across all three datasets, and conclude that ensembling provides significant performance increases, and that multi-layer ensembling is a no-risk way to improve performance on intent classification. We also find that a diverse ensemble of simple models can reach perform comparable to much more sophisticated state-of-the-art models. Our best F 1 scores on ATIS, Banking, and SMP are 97.54%, 91.79%, and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsImbalanced Data Classification Techniques · Software Engineering Research · Machine Learning and Data Classification
