CoCoA-Mix: Confusion-and-Confidence-Aware Mixture Model for Context Optimization
Dasol Hong, Wooju Lee, Hyun Myung

TL;DR
CoCoA-Mix introduces a confusion-and-confidence-aware mixture model that enhances task-specific adaptation and generalization in prompt tuning for vision-language models by refining decision boundaries and adjusting prediction weights.
Contribution
It proposes a novel confusion-aware loss and confidence-aware weights within a mixture model to improve both specialization and generalization in prompt tuning.
Findings
Outperforms state-of-the-art methods in experiments.
Enhances task-specific adaptation and unseen domain generalization.
Effectively refines decision boundaries between confusing classes.
Abstract
Prompt tuning, which adapts vision-language models by freezing model parameters and optimizing only the prompt, has proven effective for task-specific adaptations. The core challenge in prompt tuning is improving specialization for a specific task and generalization for unseen domains. However, frozen encoders often produce misaligned features, leading to confusion between classes and limiting specialization. To overcome this issue, we propose a confusion-aware loss (CoA-loss) that improves specialization by refining the decision boundaries between confusing classes. Additionally, we mathematically demonstrate that a mixture model can enhance generalization without compromising specialization. This is achieved using confidence-aware weights (CoA-weights), which adjust the weights of each prediction in the mixture model based on its confidence within the class domains. Extensive…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsMultimodal Machine Learning Applications · Domain Adaptation and Few-Shot Learning · Advanced Neural Network Applications
