# Survey of reasoning using Neural networks

**Authors:** Amit Sahu

arXiv: 1702.06186 · 2017-03-03

## TL;DR

This survey reviews neural network architectures designed for reasoning and memory tasks, discussing their mechanisms, challenges, and performance on simple algorithms and question-answering applications.

## Contribution

It provides a comprehensive overview of neural architectures for reasoning, including memory networks and neural Turing machines, highlighting techniques and experimental results.

## Key findings

- Memory-augmented neural networks perform comparably to state-of-the-art on simple tasks.
- Attention mechanisms and gradient methods are crucial for training discrete memory.
- Preliminary results show promise for reasoning tasks with these architectures.

## Abstract

Reason and inference require process as well as memory skills by humans. Neural networks are able to process tasks like image recognition (better than humans) but in memory aspects are still limited (by attention mechanism, size). Recurrent Neural Network (RNN) and it's modified version LSTM are able to solve small memory contexts, but as context becomes larger than a threshold, it is difficult to use them. The Solution is to use large external memory. Still, it poses many challenges like, how to train neural networks for discrete memory representation, how to describe long term dependencies in sequential data etc. Most prominent neural architectures for such tasks are Memory networks: inference components combined with long term memory and Neural Turing Machines: neural networks using external memory resources. Also, additional techniques like attention mechanism, end to end gradient descent on discrete memory representation are needed to support these solutions. Preliminary results of above neural architectures on simple algorithms (sorting, copying) and Question Answering (based on story, dialogs) application are comparable with the state of the art. In this paper, I explain these architectures (in general), the additional techniques used and the results of their application.

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/1702.06186/full.md

## Figures

5 figures with captions in the complete paper: https://tomesphere.com/paper/1702.06186/full.md

## References

15 references — full list in the complete paper: https://tomesphere.com/paper/1702.06186/full.md

---
Source: https://tomesphere.com/paper/1702.06186