GeSERA: General-domain Summary Evaluation by Relevance Analysis

Jessica L\'opez Espejel; Ga\"el de Chalendar; Jorge Garcia Flores,; Thierry Charnois; Ivan Vladimir Meza Ruiz

arXiv:2110.03567·cs.CL·October 8, 2021

GeSERA: General-domain Summary Evaluation by Relevance Analysis

Jessica L\'opez Espejel, Ga\"el de Chalendar, Jorge Garcia Flores,, Thierry Charnois, Ivan Vladimir Meza Ruiz

PDF

Open Access

TL;DR

GeSERA is an open-source evaluation method that adapts SERA for the general domain, improving correlation with manual assessments and outperforming ROUGE in some cases.

Contribution

This paper extends SERA from biomedical to general domain by enhancing query reformulation and replacing the index, achieving better correlation with manual evaluations.

Findings

01

GeSERA outperforms SERA in most cases for general summaries.

02

GeSERA surpasses ROUGE in two TAC2009 cases.

03

Index size and human annotations significantly impact evaluation results.

Abstract

We present GeSERA, an open-source improved version of SERA for evaluating automatic extractive and abstractive summaries from the general domain. SERA is based on a search engine that compares candidate and reference summaries (called queries) against an information retrieval document base (called index). SERA was originally designed for the biomedical domain only, where it showed a better correlation with manual methods than the widely used lexical-based ROUGE method. In this paper, we take out SERA from the biomedical domain to the general one by adapting its content-based method to successfully evaluate summaries from the general domain. First, we improve the query reformulation strategy with POS Tags analysis of general-domain corpora. Second, we replace the biomedical index used in SERA with two article collections from AQUAINT-2 and Wikipedia. We conduct experiments with TAC2008,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Biomedical Text Mining and Ontologies · Sentiment Analysis and Opinion Mining