Knowledge Enhanced Sports Game Summarization
Jiaan Wang, Zhixu Li, Tingyi Zhang, Duo Zheng, Jianfeng Qu, An Liu,, Lei Zhao, Zhigang Chen

TL;DR
This paper introduces K-SportsSum, a high-quality dataset with a large knowledge base and a knowledge-enhanced summarization model that significantly improves sports news generation from live commentaries.
Contribution
The paper presents a new dataset with manual cleaning and a large knowledge base, along with a novel knowledge-enhanced summarizer for sports news generation.
Findings
Achieves state-of-the-art performance on K-SportsSum and SportsSum datasets.
Produces more informative and accurate sports news according to human evaluation.
Demonstrates the effectiveness of integrating knowledge into sports summarization.
Abstract
Sports game summarization aims at generating sports news from live commentaries. However, existing datasets are all constructed through automated collection and cleaning processes, resulting in a lot of noise. Besides, current works neglect the knowledge gap between live commentaries and sports news, which limits the performance of sports game summarization. In this paper, we introduce K-SportsSum, a new dataset with two characteristics: (1) K-SportsSum collects a large amount of data from massive games. It has 7,854 commentary-news pairs. To improve the quality, K-SportsSum employs a manual cleaning process; (2) Different from existing datasets, to narrow the knowledge gap, K-SportsSum further provides a large-scale knowledge corpus that contains the information of 523 sports teams and 14,724 sports players. Additionally, we also introduce a knowledge-enhanced summarizer that utilizes…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsVideo Analysis and Summarization · Topic Modeling · Natural Language Processing Techniques
