ORCA: Mitigating Over-Reliance for Multi-Task Dwell Time Prediction with Causal Decoupling

Huishi Luo; Fuzhen Zhuang; Yongchun Zhu; Yiqing Wu; Bo Kang; Ruobing Xie; Feng Xia; Deqing Wang; Jin Dong

arXiv:2508.16573·cs.IR·August 25, 2025

ORCA: Mitigating Over-Reliance for Multi-Task Dwell Time Prediction with Causal Decoupling

Huishi Luo, Fuzhen Zhuang, Yongchun Zhu, Yiqing Wu, Bo Kang, Ruobing Xie, Feng Xia, Deqing Wang, Jin Dong

PDF

TL;DR

This paper introduces ORCA, a causal decoupling method that improves multi-task dwell time prediction by reducing over-reliance on CTR-DT correlation, leading to more accurate moderate-duration predictions.

Contribution

ORCA is a novel causal decoupling approach that explicitly models and subtracts CTR's negative transfer, enhancing dwell time prediction in recommender systems.

Findings

01

10.6% average improvement in dwell time metrics

02

Preserves CTR performance while improving DT predictions

03

Model-agnostic and easy to deploy

Abstract

Dwell time (DT) is a critical post-click metric for evaluating user preference in recommender systems, complementing the traditional click-through rate (CTR). Although multi-task learning is widely adopted to jointly optimize DT and CTR, we observe that multi-task models systematically collapse their DT predictions to the shortest and longest bins, under-predicting the moderate durations. We attribute this moderate-duration bin under-representation to over-reliance on the CTR-DT spurious correlation, and propose ORCA to address it with causal-decoupling. Specifically, ORCA explicitly models and subtracts CTR's negative transfer while preserving its positive transfer. We further introduce (i) feature-level counterfactual intervention, and (ii) a task-interaction module with instance inverse-weighting, weakening CTR-mediated effect and restoring direct DT semantics. ORCA is model-agnostic…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.