Gameplay Highlights Generation
Vignesh Edithal, Le Zhang, Ilia Blank, Imran Junejo

TL;DR
This paper presents a multimodal video understanding approach for automatically generating gaming highlight reels by detecting interesting events, using a fine-tuned X-CLIP model that generalizes across multiple games and platforms.
Contribution
The work introduces a novel dataset and a fine-tuned multimodal model for cross-game highlight detection without game-specific engineering, improving efficiency and generalization.
Findings
Achieved over 90% accuracy in event detection in unseen gameplay footage.
Demonstrated transfer learning benefits for low-resource games.
Enabled cross-platform deployment with ONNX and quantization.
Abstract
In this work, we enable gamers to share their gaming experience on social media by automatically generating eye-catching highlight reels from their gameplay session Our automation will save time for gamers while increasing audience engagement. We approach the highlight generation problem by first identifying intervals in the video where interesting events occur and then concatenate them. We developed an in-house gameplay event detection dataset containing interesting events annotated by humans using VIA video annotator. Traditional techniques for highlight detection such as game engine integration requires expensive collaboration with game developers. OCR techniques which detect patches of specific images or texts require expensive per game engineering and may not generalize across game UI and different language. We finetuned a multimodal general purpose video understanding model such…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsArtificial Intelligence in Games · Video Analysis and Summarization · Digital Games and Media
