tieval: An Evaluation Framework for Temporal Information Extraction Systems
Hugo Sousa, Al\'ipio Jorge, Ricardo Campos

TL;DR
Tieval is a Python library designed to standardize and simplify the evaluation of temporal information extraction systems across diverse datasets, addressing issues of comparability and evaluation metrics.
Contribution
It introduces a unified framework that handles different annotation schemes and formats, enabling fairer benchmarking of TIE systems.
Findings
Facilitates importing multiple corpora with different formats
Supports traditional and temporal-aware evaluation metrics
Enhances comparability of TIE system performance
Abstract
Temporal information extraction (TIE) has attracted a great deal of interest over the last two decades, leading to the development of a significant number of datasets. Despite its benefits, having access to a large volume of corpora makes it difficult when it comes to benchmark TIE systems. On the one hand, different datasets have different annotation schemes, thus hindering the comparison between competitors across different corpora. On the other hand, the fact that each corpus is commonly disseminated in a different format requires a considerable engineering effort for a researcher/practitioner to develop parsers for all of them. This constraint forces researchers to select a limited amount of datasets to evaluate their systems which consequently limits the comparability of the systems. Yet another obstacle that hinders the comparability of the TIE systems is the evaluation metric…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsData Management and Algorithms · Advanced Database Systems and Queries · Semantic Web and Ontologies
MethodsLib
