Portfolio Optimization Proxies under Label Scarcity and Regime Shifts via Bayesian and Deterministic Students under Semi-Supervised Sandwich Training

Adhiraj Chattopadhyay

arXiv:2604.14206·cs.LG·April 17, 2026

Portfolio Optimization Proxies under Label Scarcity and Regime Shifts via Bayesian and Deterministic Students under Semi-Supervised Sandwich Training

Adhiraj Chattopadhyay

PDF

TL;DR

This paper introduces a machine learning framework for portfolio optimization in low-data and regime-shift environments, using a teacher-student model with synthetic data augmentation to improve robustness and stability.

Contribution

It presents a novel semi-supervised training approach combining Bayesian and deterministic neural models with CVaR optimization labels for portfolio management.

Findings

01

Student models match or outperform the CVaR teacher in various settings.

02

Models show improved robustness under regime shifts.

03

Reduced turnover achieved with hybrid learning approaches.

Abstract

This paper proposes a machine learning assisted portfolio optimization framework designed for low data environments and regime uncertainty. We construct a teacher student learning pipeline in which a Conditional Value at Risk (CVaR) optimizer generates supervisory labels, and neural models (Bayesian and deterministic) are trained using both real and synthetically augmented data. The synthetic data is generated using a factor based model with t copula residuals, enabling training beyond the limited real sample of 104 labeled observations. We evaluate four student models under a structured experimental framework comprising (i) controlled synthetic experiments (3 x 5 seed grid), (ii) in-distribution real market evaluation (C2A) and (iii) cross-universe generalization (D2A). In real-market settings, models are deployed using a rolling evaluation protocol where a frozen pretrained model is…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.