An Exploration of Self-Supervised Mutual Information Alignment for Multi-Task Settings

Soham V. Govande

arXiv:2410.01704·cs.CL·June 6, 2025

An Exploration of Self-Supervised Mutual Information Alignment for Multi-Task Settings

Soham V. Govande

PDF

Open Access 1 Repo

TL;DR

This paper introduces SAMI, a self-supervised mutual information alignment method for multi-task language model tuning, demonstrating competitive performance against DPO and insights into its scaling behavior for mathematical tasks.

Contribution

The paper presents SAMI, a novel self-supervised alignment approach using mutual information, and evaluates its effectiveness across multi-task benchmarks and mathematical accuracy scenarios.

Findings

01

SAMI achieves a 57% win rate against DPO on MT-Bench.

02

SAMI increases zero-shot math accuracy by 1.1%.

03

Scaling SAMI with multiple attempts improves performance.

Abstract

There is a growing need for pluralistic alignment methods that can steer language models towards individual attributes and preferences. One such method, Self-Supervised Alignment with Mutual Information (SAMI), uses conditional mutual information to encourage the connection between behavioral preferences and model responses. We conduct two experiments exploring SAMI in multi-task settings. First, we compare SAMI to Direct Preference Optimization (DPO) on a multi-task benchmark (MT-Bench), using a stronger model to generate training data for a weaker one across diverse categories (humanities, STEM, extraction, coding, math, reasoning, and roleplay). Our results indicate that one iteration of SAMI has a 57% win rate against DPO, with significant variation in performance between task categories. Second, we examine SAMI's impact on mathematical accuracy (GSM-8K) relative to supervised…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

SohamGovande/sami-extension
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsIntelligent Tutoring Systems and Adaptive Learning · Speech and dialogue systems

MethodsDirect Preference Optimization · Shrink and Fine-Tune