Neural Twins Talk & Alternative Calculations

Zanyar Zohourianshahzadi; Jugal K. Kalita

arXiv:2108.02807·cs.CV·September 1, 2021

Neural Twins Talk & Alternative Calculations

Zanyar Zohourianshahzadi, Jugal K. Kalita

PDF

1 Repo

TL;DR

This paper explores extending deep attentive models for image captioning to improve performance, inspired by human neural pathways, with potential applications in biology and medicine.

Contribution

It introduces an approach that enhances image captioning models by mimicking human neural pathway complexity, aiming for better descriptive accuracy.

Findings

01

Improved captioning accuracy demonstrated on benchmark datasets.

02

Enhanced model performance with increased neural pathway simulation.

03

Potential applications in medical and biological image analysis.

Abstract

Inspired by how the human brain employs a higher number of neural pathways when describing a highly focused subject, we show that deep attentive models used for the main vision-language task of image captioning, could be extended to achieve better performance. Image captioning bridges a gap between computer vision and natural language processing. Automated image captioning is used as a tool to eliminate the need for human agent for creating descriptive captions for unseen images.Automated image captioning is challenging and yet interesting. One reason is that AI based systems capable of generating sentences that describe an input image could be used in a wide variety of tasks beyond generating captions for unseen images found on web or uploaded to social media. For example, in biology and medical sciences, these systems could provide researchers and physicians with a brief linguistic…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

zanyarz/NeuralTwinsTalk
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.