Large-Scale QA-SRL Parsing
Nicholas FitzGerald, Julian Michael, Luheng He, Luke Zettlemoyer

TL;DR
This paper introduces a large-scale QA-SRL corpus and a high-quality neural parser, enabling more accurate semantic role labeling through question-answer pairs across multiple domains.
Contribution
It provides the first high-quality QA-SRL parser and a large annotated corpus, advancing semantic role labeling with new neural models and crowd-sourcing methods.
Findings
QA-SRL Bank 2.0 contains over 250,000 question-answer pairs for 64,000 sentences.
Best models achieve 82.6% question accuracy and 77.6% span accuracy.
Models can be used to efficiently gather additional annotations.
Abstract
We present a new large-scale corpus of Question-Answer driven Semantic Role Labeling (QA-SRL) annotations, and the first high-quality QA-SRL parser. Our corpus, QA-SRL Bank 2.0, consists of over 250,000 question-answer pairs for over 64,000 sentences across 3 domains and was gathered with a new crowd-sourcing scheme that we show has high precision and good recall at modest cost. We also present neural models for two QA-SRL subtasks: detecting argument spans for a predicate and generating questions to label the semantic relationship. The best models achieve question accuracy of 82.6% and span-level accuracy of 77.6% (under human evaluation) on the full pipelined QA-SRL prediction task. They can also, as we show, be used to gather additional annotations at low cost.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
