# Using Automatically Extracted Minimum Spans to Disentangle Coreference   Evaluation from Boundary Detection

**Authors:** Nafise Sadat Moosavi, Leo Born, Massimo Poesio, Michael Strube

arXiv: 1906.06703 · 2019-06-18

## TL;DR

This paper introduces MINA, an algorithm that automatically extracts minimum spans for coreference evaluation, reducing annotation costs and improving cross-dataset performance by disentangling mention boundary detection from coreference scoring.

## Contribution

The paper presents MINA, a novel automatic method for extracting minimum spans, enabling scalable and more accurate coreference evaluation across diverse datasets.

## Key findings

- MINA's extracted spans align well with expert annotations.
- Using minimum spans improves cross-dataset coreference evaluation.
- Automatic extraction reduces annotation costs significantly.

## Abstract

The common practice in coreference resolution is to identify and evaluate the maximum span of mentions. The use of maximum spans tangles coreference evaluation with the challenges of mention boundary detection like prepositional phrase attachment. To address this problem, minimum spans are manually annotated in smaller corpora. However, this additional annotation is costly and therefore, this solution does not scale to large corpora. In this paper, we propose the MINA algorithm for automatically extracting minimum spans to benefit from minimum span evaluation in all corpora. We show that the extracted minimum spans by MINA are consistent with those that are manually annotated by experts. Our experiments show that using minimum spans is in particular important in cross-dataset coreference evaluation, in which detected mention boundaries are noisier due to domain shift. We will integrate MINA into https://github.com/ns-moosavi/coval for reporting standard coreference scores based on both maximum and automatically detected minimum spans.

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/1906.06703/full.md

## Figures

12 figures with captions in the complete paper: https://tomesphere.com/paper/1906.06703/full.md

## References

22 references — full list in the complete paper: https://tomesphere.com/paper/1906.06703/full.md

---
Source: https://tomesphere.com/paper/1906.06703