Dimensional Coactivation for Representational Consistency in Frozen Vision Foundation Models
Izaldein Al-Zyoud Abdulmotaleb El Saddik

TL;DR
This paper introduces Dimensional Coactivation (DCA), a novel method for measuring intra-sample representational coherence in frozen vision models, validated through deepfake detection tasks.
Contribution
The paper presents DCA, a new per-dimension coherence measure that reveals the internal organization of frozen vision models and their robustness in deepfake detection.
Findings
DCA exposes coherence breaks in deepfake faces with high AUC scores.
Ablation studies show that standard normalization reduces coherence detection performance.
Replacing the model with FaRL decreases deepfake detection effectiveness.
Abstract
Frozen vision foundation models do not merely extract features; they organize images through a learned coordinate system. We ask whether that coordinate system remains internally coherent within a single input. This leads to Representational Consistency: the study of whether a frozen foundation model represents one sample coherently across its semantic subregions. We introduce Dimensional Coactivation (DCA), a per-dimension instrument for measuring this coherence. DCA compares semantic regions by asking whether the same feature dimensions coactivate across them. Unlike classical similarity measures, it deliberately avoids centering, L2 normalization, and full Gram coupling. These operations are useful when comparing different models or distributions, but they are mismatched to the intra-sample setting, where the coordinate system is fixed and raw magnitude carries signal. Deepfake…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
