LCFO: Long Context and Long Form Output Dataset and Benchmarking
Marta R. Costa-juss\`a, Pierre Andrews, Mariano Coria Meglioli, Joy Chen, Joe Chuang, David Dale, Christophe Ropers, Alexandre Mourachko, Eduardo S\'anchez, Holger Schwenk, Tuan Tran, Arina Turkatenko, Carleigh Wood

TL;DR
This paper introduces LCFO, a comprehensive benchmark for evaluating long document summarization and expansion, including diverse domains, multiple summary lengths, QA alignments, and human and AI evaluation metrics.
Contribution
The paper presents LCFO, a novel dataset and benchmarking framework for long document summarization and expansion with multi-domain, multi-length summaries, and QA alignment.
Findings
GPT-4o-mini outperforms other models in summarization and expansion.
Automatic metrics show low correlation with human scores (~0.4).
Moderate correlation (~0.6) on fluency and attribution evaluations.
Abstract
This paper presents the Long Context and Form Output (LCFO) benchmark, a novel evaluation framework for assessing gradual summarization and summary expansion capabilities across diverse domains. LCFO consists of long input documents (5k words average length), each of which comes with three summaries of different lengths (20%, 10%, and 5% of the input text), as well as approximately 15 questions and answers (QA) related to the input content. Notably, LCFO also provides alignments between specific QA pairs and corresponding summaries in 7 domains. The primary motivation behind providing summaries of different lengths is to establish a controllable framework for generating long texts from shorter inputs, i.e. summary expansion. To establish an evaluation metric framework for summarization and summary expansion, we provide human evaluation scores for human-generated outputs, as well as…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsReservoir Engineering and Simulation Methods
