DFPE: A Diverse Fingerprint Ensemble for Enhancing LLM Performance

Seffi Cohen; Niv Goldshlager; Nurit Cohen-Inger; Bracha Shapira; Lior; Rokach

arXiv:2501.17479·cs.LG·February 10, 2025

DFPE: A Diverse Fingerprint Ensemble for Enhancing LLM Performance

Seffi Cohen, Niv Goldshlager, Nurit Cohen-Inger, Bracha Shapira, Lior, Rokach

PDF

Open Access 1 Repo 1 Video

TL;DR

The paper introduces DFPE, an ensemble method that combines multiple LLMs using clustering, filtering, and weighting to improve performance and robustness across diverse language tasks.

Contribution

It presents a novel ensemble approach that leverages model diversity and adaptive weighting to enhance LLM performance on complex benchmarks.

Findings

01

DFPE outperforms single models by 3% overall accuracy.

02

DFPE achieves 5% higher accuracy at the discipline level.

03

The method improves robustness and generalization of LLMs.

Abstract

Large Language Models (LLMs) have shown remarkable capabilities across various natural language processing tasks but often struggle to excel uniformly in diverse or complex domains. We propose a novel ensemble method - Diverse Fingerprint Ensemble (DFPE), which leverages the complementary strengths of multiple LLMs to achieve more robust performance. Our approach involves: (1) clustering models based on response "fingerprints" patterns, (2) applying a quantile-based filtering mechanism to remove underperforming models at a per-subject level, and (3) assigning adaptive weights to remaining models based on their subject-wise validation accuracy. In experiments on the Massive Multitask Language Understanding (MMLU) benchmark, DFPE outperforms the best single model by 3% overall accuracy and 5% in discipline-level accuracy. This method increases the robustness and generalization of LLMs and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

nivgold/dfpe
pytorchOfficial

Videos

DFPE: A Diverse Fingerprint Ensemble for Enhancing LLM Performance· underline

Taxonomy

TopicsDigital Rights Management and Security