Adaptive Pattern Extraction Multi-Task Learning for Multi-Step Conversion Estimations
Xuewen Tao, Mingming Ha, Xiaobo Guo, Qiongxu Ma, Hongwei, Cheng, Wenfang Lin

TL;DR
This paper introduces APEM, an adaptive multi-task learning framework that enhances feature representation and task-specific learning, significantly improving multi-step conversion estimations in industrial applications.
Contribution
The paper proposes a novel APEM framework with a DeepAuto Group Transformer and Pattern Selector for adaptive, fine-grained feature learning in multi-task settings.
Findings
Outperforms state-of-the-art MTL methods on public datasets
Demonstrates superior online performance in industrial recommendation systems
Effectively balances task relationships to improve overall accuracy
Abstract
Multi-task learning (MTL) has been successfully used in many real-world applications, which aims to simultaneously solve multiple tasks with a single model. The general idea of multi-task learning is designing kinds of global parameter sharing mechanism and task-specific feature extractor to improve the performance of all tasks. However, challenge still remains in balancing the trade-off of various tasks since model performance is sensitive to the relationships between them. Less correlated or even conflict tasks will deteriorate the performance by introducing unhelpful or negative information. Therefore, it is important to efficiently exploit and learn fine-grained feature representation corresponding to each task. In this paper, we propose an Adaptive Pattern Extraction Multi-task (APEM) framework, which is adaptive and flexible for large-scale industrial application. APEM is able to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Text and Document Classification Technologies · Machine Learning and ELM
MethodsAttention Is All You Need · Dense Connections · Adam · Position-Wise Feed-Forward Layer · Softmax · Linear Layer · Multi-Head Attention · Absolute Position Encodings · Dropout · Label Smoothing
