SUMMIR: A Hallucination-Aware Framework for Ranking Sports Insights from LLMs

Nitish Kumar; Sannu Kumar; S Akash; Manish Gupta; Ankith Karat; Sriparna Saha

arXiv:2604.04947·cs.IR·April 8, 2026

SUMMIR: A Hallucination-Aware Framework for Ranking Sports Insights from LLMs

Nitish Kumar, Sannu Kumar, S Akash, Manish Gupta, Ankith Karat, Sriparna Saha

PDF

1 Repo

TL;DR

This paper introduces SUMMIR, a framework that ranks sports insights generated by LLMs, ensuring factual accuracy and relevance, and provides a dataset and evaluation methodology for sports news insights.

Contribution

It presents a novel ranking architecture for sports insights, a curated dataset, and a comprehensive evaluation pipeline for factual consistency and relevance of LLM outputs.

Findings

01

SUMMIR effectively ranks relevant sports insights.

02

Significant differences in factual consistency across LLMs.

03

The dataset covers 7,900 articles across four major sports.

Abstract

With the rapid proliferation of online sports journalism, extracting meaningful pre-game and post-game insights from articles is essential for enhancing user engagement and comprehension. In this paper, we address the task of automatically extracting such insights from articles published before and after matches. We curate a dataset of 7,900 news articles covering 800 matches across four major sports: Cricket, Soccer, Basketball, and Baseball. To ensure contextual relevance, we employ a two-step validation pipeline leveraging both open-source and proprietary large language models (LLMs). We then utilize multiple state-of-the-art LLMs (GPT-4o, Qwen2.5-72B-Instruct, Llama-3.3-70B-Instruct, and Mixtral-8x7B-Instruct-v0.1) to generate comprehensive insights. The factual accuracy of these outputs is rigorously assessed using a FactScore-based methodology, complemented by hallucination…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

nitish-iitp/SUMMIR
github

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.