Large-Scale QA-SRL Parsing

Nicholas FitzGerald; Julian Michael; Luheng He; Luke Zettlemoyer

arXiv:1805.05377·cs.CL·May 16, 2018

Large-Scale QA-SRL Parsing

Nicholas FitzGerald, Julian Michael, Luheng He, Luke Zettlemoyer

PDF

3 Repos 2 Datasets

TL;DR

This paper introduces a large-scale QA-SRL corpus and a high-quality neural parser, enabling more accurate semantic role labeling through question-answer pairs across multiple domains.

Contribution

It provides the first high-quality QA-SRL parser and a large annotated corpus, advancing semantic role labeling with new neural models and crowd-sourcing methods.

Findings

01

QA-SRL Bank 2.0 contains over 250,000 question-answer pairs for 64,000 sentences.

02

Best models achieve 82.6% question accuracy and 77.6% span accuracy.

03

Models can be used to efficiently gather additional annotations.

Abstract

We present a new large-scale corpus of Question-Answer driven Semantic Role Labeling (QA-SRL) annotations, and the first high-quality QA-SRL parser. Our corpus, QA-SRL Bank 2.0, consists of over 250,000 question-answer pairs for over 64,000 sentences across 3 domains and was gathered with a new crowd-sourcing scheme that we show has high precision and good recall at modest cost. We also present neural models for two QA-SRL subtasks: detecting argument spans for a predicate and generating questions to label the semantic relationship. The best models achieve question accuracy of 82.6% and span-level accuracy of 77.6% (under human evaluation) on the full pipelined QA-SRL prediction task. They can also, as we show, be used to gather additional annotations at low cost.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Datasets

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.