Pareto-guided Pipeline for Distilling Featherweight AI Agents in Mobile MOBA Games
Xionghui Yang, Bozhou Chen, Yunlong Lu, Yongyi Wang, Lingfeng Li, Lanxiao Huang, Lin Liu, Wenjun Wang, Meng Meng, Xia Lin, Wenxin Li

TL;DR
This paper presents a Pareto-guided pipeline for distilling large game AI models into lightweight, efficient agents suitable for mobile MOBA games, balancing performance and resource constraints.
Contribution
It introduces a systematic pipeline and a tailored search space for creating mobile-friendly AI agents that maintain high performance while significantly improving efficiency.
Findings
12.4x faster inference speed
15.6x better energy efficiency
40.32% win rate retention
Abstract
Recent advances in game AI have demonstrated the feasibility of training agents that surpass top-tier human professionals in complex environments such as Honor of Kings (HoK), a leading mobile multiplayer online battle arena (MOBA) game. However, deploying such powerful agents on mobile devices remains a major challenge. On one hand, the intricate multi-modal state representation and hierarchical action space of HoK demand large, sophisticated policy networks that are inherently difficult to compress into lightweight forms. On the other hand, production deployment requires high-frequency inference under strict energy and latency constraints on mobile platform. To the best of our knowledge, bridging large-scale game AI and practical on-device deployment has not been systematically studied. In this work, we propose a Pareto optimality guided pipeline and design a high-efficiency student…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Reinforcement Learning in Robotics · IoT and Edge/Fog Computing
