Differentiable Reasoning over Long Stories -- Assessing Systematic   Generalisation in Neural Models

Wanshui Li; Pasquale Minervini

arXiv:2203.10620·cs.CL·March 22, 2022·1 cites

Differentiable Reasoning over Long Stories -- Assessing Systematic Generalisation in Neural Models

Wanshui Li, Pasquale Minervini

PDF

Open Access

TL;DR

This paper evaluates neural models' ability to generalize systematically over long stories using the CLUTRR benchmark, revealing that modified RNNs perform well while graph neural networks are more robust.

Contribution

It provides a comprehensive analysis of neural models' systematic generalization on long stories, comparing graph-based and sequence-based approaches.

Findings

01

Modified RNNs outperform graph neural networks in accuracy.

02

Graph neural networks exhibit greater robustness across tasks.

03

Empirical evaluation on CLUTRR benchmark demonstrates model strengths and weaknesses.

Abstract

Contemporary neural networks have achieved a series of developments and successes in many aspects; however, when exposed to data outside the training distribution, they may fail to predict correct answers. In this work, we were concerned about this generalisation issue and thus analysed a broad set of models systematically and robustly over long stories. Related experiments were conducted based on the CLUTRR, which is a diagnostic benchmark suite that can analyse generalisation of natural language understanding (NLU) systems by training over small story graphs and testing on larger ones. In order to handle the multi-relational story graph, we consider two classes of neural models: "E-GNN", the graph-based models that can process graph-structured data and consider the edge attributes simultaneously; and "L-Graph", the sequence-based models which can process linearized version of the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Multimodal Machine Learning Applications