Material Recognition in the Wild with the Materials in Context Database
Sean Bell, Paul Upchurch, Noah Snavely, Kavita Bala

TL;DR
This paper introduces MINC, a large-scale, diverse dataset for real-world material recognition, and demonstrates how deep learning models trained on MINC can effectively classify and segment materials in complex images.
Contribution
The paper presents MINC, the largest and most diverse material dataset to date, and develops CNN-based methods for material classification and segmentation in natural images.
Findings
CNNs achieve 85.2% accuracy on patch classification
Fully convolutional CNN + CRF achieves 73.1% pixel-wise accuracy
Large dataset like MINC is essential for real-world material recognition
Abstract
Recognizing materials in real-world images is a challenging task. Real-world materials have rich surface texture, geometry, lighting conditions, and clutter, which combine to make the problem particularly difficult. In this paper, we introduce a new, large-scale, open dataset of materials in the wild, the Materials in Context Database (MINC), and combine this dataset with deep learning to achieve material recognition and segmentation of images in the wild. MINC is an order of magnitude larger than previous material databases, while being more diverse and well-sampled across its 23 categories. Using MINC, we train convolutional neural networks (CNNs) for two tasks: classifying materials from patches, and simultaneous material recognition and segmentation in full images. For patch-based classification on MINC we found that the best performing CNN architectures can achieve 85.2% mean…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Remote-Sensing Image Classification · Advanced Image and Video Retrieval Techniques
