DEPHN: Different Expression Parallel Heterogeneous Network using virtual gradient optimization for Multi-task Learning
Menglin Kong, Ri Su, Shaojie Zhao, Muzhou Hou

TL;DR
DEPHN introduces a novel multi-task learning network that models complex task relationships using diverse feature interactions and adaptive gradient optimization, enhancing recommendation system performance.
Contribution
The paper proposes DEPHN, a new multi-task learning model with heterogeneous experts and virtual gradient optimization to better capture complex task correlations.
Findings
DEPHN outperforms baseline models on artificial and real-world datasets.
The model effectively captures complex task relationships.
Adaptive gradient optimization improves learning efficiency.
Abstract
Recommendation system algorithm based on multi-task learning (MTL) is the major method for Internet operators to understand users and predict their behaviors in the multi-behavior scenario of platform. Task correlation is an important consideration of MTL goals, traditional models use shared-bottom models and gating experts to realize shared representation learning and information differentiation. However, The relationship between real-world tasks is often more complex than existing methods do not handle properly sharing information. In this paper, we propose an Different Expression Parallel Heterogeneous Network (DEPHN) to model multiple tasks simultaneously. DEPHN constructs the experts at the bottom of the model by using different feature interaction methods to improve the generalization ability of the shared information flow. In view of the model's differentiating ability for…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRecommender Systems and Techniques · Human Pose and Action Recognition · Domain Adaptation and Few-Shot Learning
