Theory of Mixture-of-Experts for Mobile Edge Computing
Hongbo Li, Lingjie Duan

TL;DR
This paper introduces a mixture-of-experts framework tailored for mobile edge computing to improve continual learning by dynamically routing tasks to specialized servers, reducing generalization error over time.
Contribution
It pioneers the application of MoE theory in MEC networks for online task streams, with an adaptive gating network and theoretical analysis of expert requirements.
Findings
MoE reduces generalization error over time in MEC.
Optimal number of experts ensures convergence and task specialization.
Adding excess experts can delay convergence and increase error.
Abstract
In mobile edge computing (MEC) networks, mobile users generate diverse machine learning tasks dynamically over time. These tasks are typically offloaded to the nearest available edge server, by considering communication and computational efficiency. However, its operation does not ensure that each server specializes in a specific type of tasks and leads to severe overfitting or catastrophic forgetting of previous tasks. To improve the continual learning (CL) performance of online tasks, we are the first to introduce mixture-of-experts (MoE) theory in MEC networks and save MEC operation from the increasing generalization error over time. Our MoE theory treats each MEC server as an expert and dynamically adapts to changes in server availability by considering data transfer and computation time. Unlike existing MoE models designed for offline tasks, ours is tailored for handling continuous…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHuman Mobility and Location-Based Analysis
MethodsMixture of Experts
