Examining the Metrics for Document-Level Claim Extraction in Czech and Slovak
Lucia Makaiova, Martin Fajcik, Antonin Jarolim

TL;DR
This paper investigates methods for aligning and evaluating document-level claim extraction in Czech and Slovak, highlighting current limitations and proposing the need for more advanced semantic similarity measures.
Contribution
It introduces an alignment-based evaluation framework for claim extraction in Czech and Slovak, addressing challenges posed by informal language and local context.
Findings
Current evaluation methods have limitations in capturing semantic similarity.
Informal language and local context complicate claim extraction in Czech and Slovak.
More advanced evaluation methods are needed for reliable claim assessment.
Abstract
Document-level claim extraction remains an open challenge in the field of fact-checking, and subsequently, methods for evaluating extracted claims have received limited attention. In this work, we explore approaches to aligning two sets of claims pertaining to the same source document and computing their similarity through an alignment score. We investigate techniques to identify the best possible alignment and evaluation method between claim sets, with the aim of providing a reliable evaluation framework. Our approach enables comparison between model-extracted and human-annotated claim sets, serving as a metric for assessing the extraction performance of models and also as a possible measure of inter-annotator agreement. We conduct experiments on newly collected dataset-claims extracted from comments under Czech and Slovak news articles-domains that pose additional challenges due to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Sentiment Analysis and Opinion Mining · Advanced Text Analysis Techniques
