Is "My Favorite New Movie" My Favorite Movie? Probing the Understanding of Recursive Noun Phrases
Qing Lyu, Hua Zheng, Daoxin Li, Li Zhang, Marianna Apidianaki, Chris, Callison-Burch

TL;DR
This paper introduces the Recursive Noun Phrase Challenge dataset to evaluate whether language models understand recursive noun phrases, revealing current models' limitations and potential for learning this knowledge.
Contribution
The paper presents RNPC, a novel dataset for testing recursive NP understanding, and demonstrates that models can learn this knowledge with proper training.
Findings
State-of-the-art models perform at chance on RNPC
Models can learn recursive NP understanding with appropriate data
Training on RNPC improves zero-shot performance on harm detection
Abstract
Recursive noun phrases (NPs) have interesting semantic properties. For example, "my favorite new movie" is not necessarily my favorite movie, whereas "my new favorite movie" is. This is common sense to humans, yet it is unknown whether language models have such knowledge. We introduce the Recursive Noun Phrase Challenge (RNPC), a dataset of three textual inference tasks involving textual entailment and event plausibility comparison, precisely targeting the understanding of recursive NPs. When evaluated on RNPC, state-of-the-art Transformer models only perform around chance. Still, we show that such knowledge is learnable with appropriate data. We further probe the models for relevant linguistic features that can be learned from our tasks, including modifier semantic category and modifier scope. Finally, models trained on RNPC achieve strong zero-shot performance on an extrinsic Harm…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Language Processing Techniques · Topic Modeling · Text Readability and Simplification
MethodsAttention Is All You Need · Linear Layer · Position-Wise Feed-Forward Layer · Dense Connections · Residual Connection · Layer Normalization · Dropout · Label Smoothing · Byte Pair Encoding · Softmax
