Temporal expression normalisation in natural language texts

Michele Filannino

arXiv:1206.2010·cs.CL·June 12, 2012

Temporal expression normalisation in natural language texts

Michele Filannino

PDF

TL;DR

This paper presents a new rule-based system for normalizing temporal expressions in English texts, outperforming existing methods and providing a new annotated corpus for research.

Contribution

A novel rule-based architecture for temporal expression normalization, along with a new annotated corpus, advancing the state-of-the-art in temporal information extraction.

Findings

01

Outperforms TempEval-2 shared task systems

02

Achieves better results than previous systems

03

Provides a publicly available annotated corpus

Abstract

Automatic annotation of temporal expressions is a research challenge of great interest in the field of information extraction. In this report, I describe a novel rule-based architecture, built on top of a pre-existing system, which is able to normalise temporal expressions detected in English texts. Gold standard temporally-annotated resources are limited in size and this makes research difficult. The proposed system outperforms the state-of-the-art systems with respect to TempEval-2 Shared Task (value attribute) and achieves substantially better results with respect to the pre-existing system on top of which it has been developed. I will also introduce a new free corpus consisting of 2822 unique annotated temporal expressions. Both the corpus and the system are freely available on-line.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.