# Lexical Features in Coreference Resolution: To be Used With Caution

**Authors:** Nafise Sadat Moosavi, Michael Strube

arXiv: 1704.06779 · 2017-04-25

## TL;DR

This paper critically examines the reliance on lexical features in coreference resolution, highlighting issues with generalization to unseen domains and flaws in current evaluation practices.

## Contribution

It reveals the limitations of lexical features in domain generalization and exposes evaluation flaws due to dataset overlaps in coreference resolution.

## Key findings

- Heavy reliance on lexical features hampers domain adaptation.
- Current evaluation datasets have significant overlaps, skewing results.
- Lexical features should be used cautiously in coreference models.

## Abstract

Lexical features are a major source of information in state-of-the-art coreference resolvers. Lexical features implicitly model some of the linguistic phenomena at a fine granularity level. They are especially useful for representing the context of mentions. In this paper we investigate a drawback of using many lexical features in state-of-the-art coreference resolvers. We show that if coreference resolvers mainly rely on lexical features, they can hardly generalize to unseen domains. Furthermore, we show that the current coreference resolution evaluation is clearly flawed by only evaluating on a specific split of a specific dataset in which there is a notable overlap between the training, development and test sets.

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/1704.06779/full.md

## References

13 references — full list in the complete paper: https://tomesphere.com/paper/1704.06779/full.md

---
Source: https://tomesphere.com/paper/1704.06779