More is Less? A Simulation-Based Approach to Dynamic Interactions between Biases in Multimodal Models
Mounia Drissi

TL;DR
This paper introduces a simulation-based framework to analyze how biases in multimodal models interact dynamically, revealing amplification, mitigation, and neutral bias interactions across text and image modalities.
Contribution
It presents a systemic, heuristic approach to classify and analyze bias interactions in multimodal models, offering insights into bias dynamics and stability.
Findings
Amplification occurs in 22% of cases when biases are comparable.
Mitigation is observed in 11% of cases, often with dominant text bias.
Neutral interactions, 67%, are linked to higher but stable text bias.
Abstract
Multimodal machine learning models, such as those that combine text and image modalities, are increasingly used in critical domains including public safety, security, and healthcare. However, these systems inherit biases from their single modalities. This study proposes a systemic framework for analyzing dynamic multimodal bias interactions. Using the MMBias dataset, which encompasses categories prone to bias such as religion, nationality, and sexual orientation, this study adopts a simulation-based heuristic approach to compute bias scores for text-only, image-only, and multimodal embeddings. A framework is developed to classify bias interactions as amplification (multimodal bias exceeds both unimodal biases), mitigation (multimodal bias is lower than both), and neutrality (multimodal bias lies between unimodal biases), with proportional analyzes conducted to identify the dominant mode…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSpeech and dialogue systems · Multi-Agent Systems and Negotiation
