DIAMOND: An LLM-Driven Agent for Context-Aware Baseball Highlight Summarization
Jeonghun Kang, Soonmok Kwon, Joonseok Lee, Byung-Hak Kim

TL;DR
DIAMOND is a hybrid LLM-driven system that enhances baseball highlight summarization by combining sabermetric analytics with natural language reasoning, significantly improving event detection accuracy.
Contribution
This paper introduces DIAMOND, a novel hybrid framework integrating sports analytics and LLMs for context-aware sports highlight summarization, surpassing existing methods.
Findings
F1-score improved from 42.9% to 84.8%.
Outperforms commercial and statistical baselines.
Effective in diverse baseball games.
Abstract
Traditional approaches -- such as Win Probability Added (WPA)-based ranking or computer vision-driven event detection -- can identify scoring plays but often miss strategic depth, momentum shifts, and storyline progression. Manual curation remains the gold standard but is resource-intensive and not scalable. We introduce DIAMOND, an LLM-driven agent for context-aware baseball highlight summarization that integrates structured sports analytics with natural language reasoning. DIAMOND leverages sabermetric features -- Win Expectancy, WPA, and Leverage Index -- to quantify play importance, while an LLM module enhances selection based on contextual narrative value. This hybrid approach ensures both quantitative rigor and qualitative richness, surpassing the limitations of purely statistical or vision-based systems. Evaluated on five diverse Korean Baseball Organization League games, DIAMOND…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsData Quality and Management · Topic Modeling · Time Series Analysis and Forecasting
