TL;DR
SAGA is a transformer-based model with conformal calibration for probabilistic long-term earnings forecasting, outperforming traditional parametric models and providing reliable prediction intervals.
Contribution
The paper introduces SAGA, a novel sequence-adaptive generative architecture with conformal prediction for accurate, long-horizon earnings forecasts with guaranteed coverage.
Findings
SAGA reduces CRPS by 31.9% at 10-year horizon.
SAGA decreases MAE by 37.7% at 20-year horizon.
Conformal intervals achieve near-nominal coverage, within 0.4 percentage points.
Abstract
Microsimulation models used by ministries of finance and central banks rely on parametric processes for lifetime earnings that capture only first and second moments of the conditional distribution and miss long-range nonlinear structure. We propose SAGA, a decoder-only transformer for irregular tabular panel sequences, paired with a split conformal calibration wrapper that delivers individual-level prediction intervals with finite-sample marginal coverage guarantees. Trained on the longitudinal Swedish LISA register over 1990 to 2022, comprising 2,143,817 individuals and 61,284,903 person-years, the model forecasts annual labor earnings at horizons of one to thirty years and aggregates them by Monte Carlo into present-discounted lifetime earnings distributions. Against the canonical Guvenen, Karahan, Ozkan, and Song parametric process and tabular and recurrent baselines, SAGA reduces…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
