Distantly Supervised Semantic Text Detection and Recognition for Broadcast Sports Videos Understanding
Avijit Shah, Topojoy Biswas, Sathish Ramadoss, Deven Santosh Shah

TL;DR
This paper introduces a novel distant supervision method for accurately detecting and recognizing semantic text in sports clocks within broadcast sports videos, addressing unique challenges and enabling better understanding of sports events.
Contribution
It proposes a new distant supervision technique to automatically create sports clock datasets and combines it with data augmentation and state-of-the-art models for improved text detection and recognition.
Findings
Achieved high accuracy in semantic text detection and recognition in sports clocks
Developed a scalable computational pipeline for industrial deployment
Created a robust dataset for sports clock text understanding
Abstract
Comprehensive understanding of key players and actions in multiplayer sports broadcast videos is a challenging problem. Unlike in news or finance videos, sports videos have limited text. While both action recognition for multiplayer sports and detection of players has seen robust research, understanding contextual text in video frames still remains one of the most impactful avenues of sports video understanding. In this work we study extremely accurate semantic text detection and recognition in sports clocks, and challenges therein. We observe unique properties of sports clocks, which makes it hard to utilize general-purpose pre-trained detectors and recognizers, so that text can be accurately understood to the degree of being used to align to external knowledge. We propose a novel distant supervision technique to automatically build sports clock datasets. Along with suitable data…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
