Multi-Dimensional Recurrent Neural Networks

Alex Graves; Santiago Fernandez; Juergen Schmidhuber

arXiv:0705.2011·cs.AI·May 23, 2007·48 cites

Multi-Dimensional Recurrent Neural Networks

Alex Graves, Santiago Fernandez, Juergen Schmidhuber

PDF

Open Access 4 Repos

TL;DR

This paper introduces multi-dimensional recurrent neural networks (MDRNNs) to extend RNN capabilities to multi-dimensional data, enabling applications in vision and medical imaging while addressing scaling issues.

Contribution

The paper presents MDRNNs, a novel extension of RNNs for multi-dimensional data, overcoming previous scalability limitations and broadening application scope.

Findings

01

Effective in image segmentation tasks

02

Addresses scaling problems of multi-dimensional models

03

Potential applications in vision and medical imaging

Abstract

Recurrent neural networks (RNNs) have proved effective at one dimensional sequence learning tasks, such as speech and online handwriting recognition. Some of the properties that make RNNs suitable for such tasks, for example robustness to input warping, and the ability to access contextual information, are also desirable in multidimensional domains. However, there has so far been no direct way of applying RNNs to data with more than one spatio-temporal dimension. This paper introduces multi-dimensional recurrent neural networks (MDRNNs), thereby extending the potential applicability of RNNs to vision, video processing, medical imaging and many other areas, while avoiding the scaling problems that have plagued other multi-dimensional models. Experimental results are provided for two image segmentation tasks.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNeural Networks and Applications · Time Series Analysis and Forecasting · Speech Recognition and Synthesis