3M: Multi-modal Multi-task Multi-teacher Learning for Game Event Detection
Thye Shan Ng, Feiqi Cao, Soyeon Caren Han

TL;DR
This paper introduces a multi-modal, multi-task, multi-teacher learning framework for detecting game events in esports, integrating diverse data sources to improve understanding of complex gameplay situations.
Contribution
It proposes a novel multi-teacher framework that independently trains on different tasks, enhancing game event detection beyond conventional multi-modal models.
Findings
Effective in improving game event detection accuracy
Demonstrates advantages over traditional single-teacher models
Enhances comprehension of complex esports gameplay
Abstract
Esports has rapidly emerged as a global phenomenon with an ever-expanding audience via platforms, like YouTube. Due to the inherent complexity nature of the game, it is challenging for newcomers to comprehend what the event entails. The chaotic nature of online chat, the fast-paced speech of the game commentator, and the game-specific user interface further compound the difficulty for users in comprehending the gameplay. To overcome these challenges, it is crucial to integrate the Multi-Modal (MM) information from the platform and understand the event. The paper introduces a new MM multi-teacher-based game event detection framework, with the ultimate goal of constructing a comprehensive framework that enhances the comprehension of the ongoing game situation. While conventional MM models typically prioritise aligning MM data through concurrent training towards a unified objective, our…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsVideo Analysis and Summarization · Artificial Intelligence in Games · Digital Games and Media
