TL;DR
LISA is a neural network model that integrates explicit syntactic information into self-attention for improved semantic role labeling, achieving state-of-the-art results without extensive pre-processing.
Contribution
The paper introduces LISA, a novel self-attention based model that incorporates syntax directly from raw tokens and improves SRL performance.
Findings
Achieves new state-of-the-art F1 scores on CoNLL-2005 SRL.
Improves SRL accuracy with high-quality syntactic parses at test time.
Outperforms previous models even with contextualized word embeddings.
Abstract
Current state-of-the-art semantic role labeling (SRL) uses a deep neural network with no explicit linguistic features. However, prior work has shown that gold syntax trees can dramatically improve SRL decoding, suggesting the possibility of increased accuracy from explicit modeling of syntax. In this work, we present linguistically-informed self-attention (LISA): a neural network model that combines multi-head self-attention with multi-task learning across dependency parsing, part-of-speech tagging, predicate detection and SRL. Unlike previous models which require significant pre-processing to prepare linguistic features, LISA can incorporate syntax using merely raw tokens as input, encoding the sequence only once to simultaneously perform parsing, predicate detection and role labeling for all predicates. Syntax is incorporated by training one attention head to attend to syntactic…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
