Portfolio Optimization Proxies under Label Scarcity and Regime Shifts via Bayesian and Deterministic Students under Semi-Supervised Sandwich Training
Adhiraj Chattopadhyay

TL;DR
This paper introduces a machine learning framework for portfolio optimization in low-data and regime-shift environments, using a teacher-student model with synthetic data augmentation to improve robustness and stability.
Contribution
It presents a novel semi-supervised training approach combining Bayesian and deterministic neural models with CVaR optimization labels for portfolio management.
Findings
Student models match or outperform the CVaR teacher in various settings.
Models show improved robustness under regime shifts.
Reduced turnover achieved with hybrid learning approaches.
Abstract
This paper proposes a machine learning assisted portfolio optimization framework designed for low data environments and regime uncertainty. We construct a teacher student learning pipeline in which a Conditional Value at Risk (CVaR) optimizer generates supervisory labels, and neural models (Bayesian and deterministic) are trained using both real and synthetically augmented data. The synthetic data is generated using a factor based model with t copula residuals, enabling training beyond the limited real sample of 104 labeled observations. We evaluate four student models under a structured experimental framework comprising (i) controlled synthetic experiments (3 x 5 seed grid), (ii) in-distribution real market evaluation (C2A) and (iii) cross-universe generalization (D2A). In real-market settings, models are deployed using a rolling evaluation protocol where a frozen pretrained model is…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
