TIES-Merging: Resolving Interference When Merging Models
Prateek Yadav, Derek Tam, Leshem Choshen, Colin Raffel, Mohit Bansal

TL;DR
TIES-Merging is a novel model merging technique that effectively resolves parameter interference, including sign conflicts and redundancy, to combine multiple fine-tuned models into a robust multitask model across various settings.
Contribution
The paper introduces TIES-Merging, a new method that addresses interference issues in model merging by resolving sign conflicts and parameter redundancy, improving performance over existing techniques.
Findings
TIES-Merging outperforms existing methods across diverse tasks and models.
Resolving sign interference significantly improves merging outcomes.
Parameter redundancy impacts model merging effectiveness.
Abstract
Transfer learning - i.e., further fine-tuning a pre-trained model on a downstream task - can confer significant advantages, including improved downstream performance, faster convergence, and better sample efficiency. These advantages have led to a proliferation of task-specific fine-tuned models, which typically can only perform a single task and do not benefit from one another. Recently, model merging techniques have emerged as a solution to combine multiple task-specific models into a single multitask model without performing additional training. However, existing merging methods often ignore the interference between parameters of different models, resulting in large performance drops when merging multiple models. In this paper, we demonstrate that prior merging techniques inadvertently lose valuable information due to two major sources of interference: (a) interference due to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
- 🤗inflatebot/MN-12B-Mag-Mell-R1model· 853 dl· ♡ 235853 dl♡ 235
- 🤗ClaudioItaly/TypressAI-9Bmodel· 10 dl· ♡ 310 dl♡ 3
- 🤗trashpanda-org/QwQ-32B-Snowdrop-v0model· 32 dl· ♡ 10232 dl♡ 102
- 🤗nbeerbower/Huihui-Qwen3.5-9B-abliterated-TIESmodel· 64 dl· ♡ 164 dl♡ 1
- 🤗Mantis2024/Dirty-Muse-Writer-v01-Uncensored-Erotica-NSFWmodel· 423 dl· ♡ 46423 dl♡ 46
- 🤗chargoddard/Chronorctypus-Limarobormes-13bmodel· 710 dl· ♡ 12710 dl♡ 12
- 🤗TheBloke/Chronorctypus-Limarobormes-13b-GGMLmodel· 5 dl· ♡ 45 dl♡ 4
- 🤗TheBloke/Chronorctypus-Limarobormes-13b-GPTQmodel· 7 dl· ♡ 67 dl♡ 6
- 🤗Envoid/Yousei-22Bmodel· 754 dl· ♡ 2754 dl♡ 2
- 🤗TheBloke/Chronorctypus-Limarobormes-13b-GGUFmodel· 61 dl· ♡ 261 dl♡ 2
Videos
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Topic Modeling · Multimodal Machine Learning Applications
