Combining Independent Modules in Lexical Multiple-Choice Problems
Peter D. Turney, Michael L. Littman, Jeffrey Bigham, Victor Shnayder

TL;DR
This paper explores ensemble methods for combining multiple modules in lexical multiple-choice tasks, demonstrating that ensemble approaches improve accuracy over individual modules, with a focus on three merging rules including a novel product rule.
Contribution
It introduces and evaluates a new product merging rule for combining probability distributions in lexical tasks, comparing it to existing methods.
Findings
All three merging rules outperform individual modules.
The mixture rule is not statistically the best among the three.
Ensemble methods enhance accuracy in lexical semantics problems.
Abstract
Existing statistical approaches to natural language problems are very coarse approximations to the true complexity of language processing. As such, no single technique will be best for all problem instances. Many researchers are examining ensemble methods that combine the output of multiple modules to create more accurate solutions. This paper examines three merging rules for combining probability distributions: the familiar mixture rule, the logarithmic rule, and a novel product rule. These rules were applied with state-of-the-art results to two problems used to assess human mastery of lexical semantics -- synonym questions and analogy questions. All three merging rules result in ensembles that are more accurate than any of their component modules. The differences among the three rules are not statistically significant, but it is suggestive that the popular mixture rule is not the best…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Language Processing Techniques · Topic Modeling · Advanced Text Analysis Techniques
