Directional Confusions Reveal Divergent Inductive Biases Through Rate-Distortion Geometry in Human and Machine Vision
Leyla Roksan Caglar, Pedro A.M. Mediano, Baihan Lin

TL;DR
This study compares human and machine vision, revealing that directional confusions expose different underlying biases and generalization strategies, which are not apparent through accuracy alone.
Contribution
It introduces the analysis of directional confusion asymmetries as a novel method to uncover and interpret divergent inductive biases in human and machine vision systems.
Findings
Humans show broad, weak asymmetries across many class pairs.
Deep models exhibit sparse, strong directional confusions into few categories.
Robustness training reduces asymmetry but does not replicate human-like structure.
Abstract
To humans, a robin seems more like a bird than a bird seems like a robin, but does this asymmetry also hold for machine vision? Humans and modern vision models can match each other in accuracy while making systematically different kinds of errors, differing not in how often they fail, but in who gets mistaken for whom. We show these directional confusions reveal distinct inductive biases invisible to accuracy alone. Using matched human and deep neural network responses on a natural-image categorization task under 12 perturbation types, we quantify asymmetry in confusion matrices and link its organization to the geometry of the information--error trade-off - how efficiently, and how gracefully, a system generalizes under distortion. We find that humans exhibit broad but weak asymmetries across many class pairs, whereas deep vision models show sparser, stronger directional collapses into…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
