# Label-Dependencies Aware Recurrent Neural Networks

**Authors:** Yoann Dupont, Marco Dinarelli, Isabelle Tellier

arXiv: 1706.01740 · 2017-06-07

## TL;DR

This paper introduces a simple yet effective RNN variant that re-injects label embeddings into the network, improving sequence labeling in NLP tasks and outperforming more complex models like LSTM, GRU, and CRFs.

## Contribution

It proposes a novel RNN architecture with label embeddings that enhances sequence labeling performance with fewer parameters than LSTM and GRU.

## Key findings

- Outperforms Elman and Jordan RNNs in sequence labeling tasks.
- Achieves better results than complex models like LSTM, GRU, and CRFs.
- Uses fewer parameters than LSTM and GRU while maintaining high effectiveness.

## Abstract

In the last few years, Recurrent Neural Networks (RNNs) have proved effective on several NLP tasks. Despite such great success, their ability to model \emph{sequence labeling} is still limited. This lead research toward solutions where RNNs are combined with models which already proved effective in this domain, such as CRFs. In this work we propose a solution far simpler but very effective: an evolution of the simple Jordan RNN, where labels are re-injected as input into the network, and converted into embeddings, in the same way as words. We compare this RNN variant to all the other RNN models, Elman and Jordan RNN, LSTM and GRU, on two well-known tasks of Spoken Language Understanding (SLU). Thanks to label embeddings and their combination at the hidden layer, the proposed variant, which uses more parameters than Elman and Jordan RNNs, but far fewer than LSTM and GRU, is more effective than other RNNs, but also outperforms sophisticated CRF models.

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/1706.01740/full.md

## Figures

5 figures with captions in the complete paper: https://tomesphere.com/paper/1706.01740/full.md

## References

48 references — full list in the complete paper: https://tomesphere.com/paper/1706.01740/full.md

---
Source: https://tomesphere.com/paper/1706.01740