Three approaches to facilitate DNN generalization to objects in   out-of-distribution orientations and illuminations

Akira Sakai; Taro Sunagawa; Spandan Madan; Kanata Suzuki; Takashi; Katoh; Hiromichi Kobashi; Hanspeter Pfister; Pawan Sinha; Xavier Boix,; Tomotake Sasaki

arXiv:2111.00131·cs.CV·January 27, 2022

Three approaches to facilitate DNN generalization to objects in out-of-distribution orientations and illuminations

Akira Sakai, Taro Sunagawa, Spandan Madan, Kanata Suzuki, Takashi, Katoh, Hiromichi Kobashi, Hanspeter Pfister, Pawan Sinha, Xavier Boix,, Tomotake Sasaki

PDF

Open Access 1 Repo

TL;DR

This paper explores three methods—late-stopping, batch normalization tuning, and invariance enforcement—to improve DNNs' ability to recognize objects in out-of-distribution orientations and illuminations, addressing a key challenge in AI robustness.

Contribution

It introduces and evaluates three novel approaches that significantly enhance DNNs' OoD generalization, revealing a common neural mechanism underlying these improvements.

Findings

01

Each approach improves OoD accuracy by over 20%.

02

Neurons become more category-selective and invariant to OoD conditions.

03

All methods lead to similar neural invariance mechanisms.

Abstract

The training data distribution is often biased towards objects in certain orientations and illumination conditions. While humans have a remarkable capability of recognizing objects in out-of-distribution (OoD) orientations and illuminations, Deep Neural Networks (DNNs) severely suffer in this case, even when large amounts of training examples are available. In this paper, we investigate three different approaches to improve DNNs in recognizing objects in OoD orientations and illuminations. Namely, these are (i) training much longer after convergence of the in-distribution (InD) validation accuracy, i.e., late-stopping, (ii) tuning the momentum parameter of the batch normalization layers, and (iii) enforcing invariance of the neural activity in an intermediate layer to orientation and illumination conditions. Each of these approaches substantially improves the DNN's OoD accuracy (more…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

fujitsuresearch/three-approaches-ood
tfOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Neural Network Applications · Robotics and Sensor-Based Localization · Domain Adaptation and Few-Shot Learning

MethodsBatch Normalization