Contrast Sensitivity Functions in Autoencoders
Qiang Li, Alex Gomez-Villa, Marcelo Bertalmio, Jesus Malo

TL;DR
This paper investigates how convolutional autoencoders trained on low-level vision tasks can develop human-like contrast sensitivity functions, highlighting the limitations and implications of using CNNs for modeling human visual perception.
Contribution
It demonstrates that CNN autoencoders can replicate human CSFs for certain tasks and shows that deeper networks may perform worse in modeling low-level visual phenomena.
Findings
Autoencoders can develop human-like CSFs in specific low-level tasks.
Deeper CNNs may be less effective in replicating human low-level vision phenomena.
CNN architectures' simplicity can limit their usefulness in understanding human vision.
Abstract
Three decades ago, Atick et al. suggested that human frequency sensitivity may emerge from the enhancement required for a more efficient analysis of retinal images. Here we reassess the relevance of low-level vision tasks in the explanation of the Contrast Sensitivity Functions (CSFs) in light of (1) the current trend of using artificial neural networks for studying vision, and (2) the current knowledge of retinal image representations. As a first contribution, we show that a very popular type of convolutional neural networks (CNNs), called autoencoders, may develop human-like CSFs in the spatio-temporal and chromatic dimensions when trained to perform some basic low-level vision tasks (like retinal noise and optical blur removal), but not others (like chromatic adaptation or pure reconstruction after simple bottlenecks). As an illustrative example, the best CNN (in the considered set…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
