Distinguishing mirror from glass: A 'big data' approach to material   perception

Hideki Tamura; Konrad E. Prokott; Roland W. Fleming

arXiv:1903.01671·cs.CV·March 14, 2022

Distinguishing mirror from glass: A 'big data' approach to material perception

Hideki Tamura, Konrad E. Prokott, Roland W. Fleming

PDF

TL;DR

This study uses a large dataset and neural network models to investigate how humans distinguish mirror from glass materials, revealing that shallow networks align more closely with human judgments but still fall short of human consistency.

Contribution

The paper demonstrates that shallow neural networks better predict human material perception in a challenging task, but highlights limitations in current models' ability to fully replicate human judgments.

Findings

01

Shallow networks outperform deeper ones in predicting human judgments.

02

No neural network model exceeds 0.6 correlation with human performance.

03

Models do not fully replicate the high inter-human consistency in material perception.

Abstract

Visually identifying materials is crucial for many tasks, yet material perception remains poorly understood. Distinguishing mirror from glass is particularly challenging as both materials derive their appearance from their surroundings, yet we rarely experience difficulties telling them apart. Here we took a 'big data' approach to uncovering the underlying visual cues and processes, leveraging recent advances in neural network models of vision. We trained thousands of convolutional neural networks on >750,000 simulated mirror and glass objects, and compared their performance with human judgments, as well as alternative classifiers based on 'hand-engineered' image features. For randomly chosen images, all classifiers and humans performed with high accuracy, and therefore correlated highly with one another. To tease the models apart, we then painstakingly assembled a diagnostic image set…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.