Adaptive Estimation and Inference in Semi-parametric Heterogeneous Clustered Multitask Learning via Neyman Orthogonality
Hanxiao Chen, Debarghya Mukherjee

TL;DR
This paper introduces an adaptive orthogonal estimator for semi-parametric clustered multitask learning that accurately recovers latent clusters and achieves optimal convergence rates.
Contribution
It develops a novel fused orthogonal estimator combining Neyman-orthogonal losses with data-driven fusion penalties for heterogeneous tasks.
Findings
Achieves exact recovery of latent clusters with high probability.
Attains pooled parametric convergence rates proportional to cluster size.
Outperforms strong baselines in simulations and real energy consumption data.
Abstract
We study clustered multitask learning in a semiparametric setting where tasks share a latent cluster structure in their target parameters but exhibit heterogeneous, potentially infinite-dimensional nuisance components. Such heterogeneity poses a major challenge for existing multitask learning methods, which typically rely on aligned feature spaces or homogeneous task structures. To address this challenge, we propose an adaptive fused orthogonal estimator that integrates Neyman-orthogonal losses with data-driven pairwise fusion penalties. Our framework leverages task-specific pilot estimates to calibrate the fusion penalties and combines adaptive aggregation with orthogonalization to mitigate the impact of nuisance-parameter estimation error. Theoretically, we show that the proposed estimator achieves exact recovery of the latent clustering with high probability and attains pooled…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
