The Ikshana Hypothesis of Human Scene Understanding

Venkata Satya Sai Ajay Daliparthi

arXiv:2101.10837·cs.CV·December 1, 2021·1 cites

The Ikshana Hypothesis of Human Scene Understanding

Venkata Satya Sai Ajay Daliparthi

PDF

Open Access 2 Repos

TL;DR

This paper introduces the Ikshana hypothesis inspired by neuroscience to improve deep neural networks for scene understanding, leading to a novel CNN architecture that outperforms existing methods on semantic segmentation benchmarks.

Contribution

The paper proposes the Ikshana hypothesis as a neuroscience-inspired perspective for designing neural networks, resulting in the IkshanaNet architecture for improved scene understanding.

Findings

01

IkshanaNet outperforms baselines on Cityscapes and CamVid benchmarks.

02

The hypothesis links neuroscience insights to CNN design.

03

Empirical validation shows improved semantic segmentation results.

Abstract

In recent years, deep neural networks (DNNs) achieved state-of-the-art performance on several computer vision tasks. However, the one typical drawback of these DNNs is the requirement of massive labeled data. Even though few-shot learning methods address this problem, they often use techniques such as meta-learning and metric-learning on top of the existing methods. In this work, we address this problem from a neuroscience perspective by proposing a hypothesis named Ikshana, which is supported by several findings in neuroscience. Our hypothesis approximates the refining process of conceptual gist in the human brain while understanding a natural scene/image. While our hypothesis holds no particular novelty in neuroscience, it provides a novel perspective for designing DNNs for vision tasks. By following the Ikshana hypothesis, we design a novel neural-inspired CNN architecture named…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Neural Network Applications · Human Pose and Action Recognition · Domain Adaptation and Few-Shot Learning

MethodsConvolution · SGD with Momentum · Batch Normalization · *Communicated@Fast*How Do I Communicate to Expedia? · 1x1 Convolution · The Ikshana Hypothesis of Human Scene Understanding Mechanism