The Ikshana Hypothesis of Human Scene Understanding
Venkata Satya Sai Ajay Daliparthi

TL;DR
This paper introduces the Ikshana hypothesis inspired by neuroscience to improve deep neural networks for scene understanding, leading to a novel CNN architecture that outperforms existing methods on semantic segmentation benchmarks.
Contribution
The paper proposes the Ikshana hypothesis as a neuroscience-inspired perspective for designing neural networks, resulting in the IkshanaNet architecture for improved scene understanding.
Findings
IkshanaNet outperforms baselines on Cityscapes and CamVid benchmarks.
The hypothesis links neuroscience insights to CNN design.
Empirical validation shows improved semantic segmentation results.
Abstract
In recent years, deep neural networks (DNNs) achieved state-of-the-art performance on several computer vision tasks. However, the one typical drawback of these DNNs is the requirement of massive labeled data. Even though few-shot learning methods address this problem, they often use techniques such as meta-learning and metric-learning on top of the existing methods. In this work, we address this problem from a neuroscience perspective by proposing a hypothesis named Ikshana, which is supported by several findings in neuroscience. Our hypothesis approximates the refining process of conceptual gist in the human brain while understanding a natural scene/image. While our hypothesis holds no particular novelty in neuroscience, it provides a novel perspective for designing DNNs for vision tasks. By following the Ikshana hypothesis, we design a novel neural-inspired CNN architecture named…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Human Pose and Action Recognition · Domain Adaptation and Few-Shot Learning
MethodsConvolution · SGD with Momentum · Batch Normalization · *Communicated@Fast*How Do I Communicate to Expedia? · 1x1 Convolution · The Ikshana Hypothesis of Human Scene Understanding Mechanism
