Breaking the Dimensional Barrier: Dynamic Portfolio Choice with Parameter Uncertainty via Pontryagin Projection
Jeonggyu Huh, Hyeng Keun Koo

TL;DR
This paper introduces a novel two-stage simulation-based method for dynamic portfolio optimization under parameter uncertainty, combining stochastic gradient ascent with a Pontryagin projection to improve stability and accuracy.
Contribution
It develops PG-DPO, a new approach integrating BPTT-based optimization with Pontryagin projection, to handle latent parameter uncertainty in continuous-time portfolio choice.
Findings
Projection stabilizes learning in high-dimensional settings
Accurately recovers analytic decision-time references
Outperforms model-free PPO baseline in experiments
Abstract
We study continuous-time CRRA portfolio choice in diffusion markets with estimated and hence uncertain coefficients. Nature draws a latent parameter at time and keeps it fixed; the investor never observes and must commit to a single -blind policy maximizing an ex-ante objective, treating as a decision-time input. We propose a simulation-only two-stage solver.Stage 1 (DPO) performs BPTT-based stochastic gradient ascent through an Euler simulator while sampling only inside the simulator. Stage 2 (Pontryagin projection) aggregates costate blocks across and enforces the -aggregated stationarity condition within the deployable class; the resulting correction can be amortized via interactive distillation. We refer to the full Stage 1 + Stage 2 pipeline as PG-DPO.We prove a uniform conditional BPTT-PMP correspondence and a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStochastic processes and financial applications · Stochastic Gradient Optimization Techniques · Risk and Portfolio Optimization
