Bayesian analysis of flexible Heckman selection models using Hamiltonian Monte Carlo

Heeju Lim; Victor E. Lachos; Victor H. Lachos

arXiv:2510.20942·stat.ME·February 9, 2026

Bayesian analysis of flexible Heckman selection models using Hamiltonian Monte Carlo

Heeju Lim, Victor E. Lachos, Victor H. Lachos

PDF

TL;DR

This paper introduces a Bayesian approach to Heckman selection models that accounts for heavy-tailed error distributions using Hamiltonian Monte Carlo, improving robustness in econometric analysis.

Contribution

It extends Heckman models by replacing Gaussian errors with scale mixture distributions and implements the approach in Stan, enabling more flexible and robust inference.

Findings

01

Models with Student's-t and contaminated normal errors outperform Gaussian models in heavy-tailed scenarios.

02

The methodology is validated through simulation studies and real-world data applications.

03

The approach is implemented in the R package HeckmanStan for accessible use.

Abstract

The Heckman selection model is widely used in econometric analysis and other social sciences to address sample selection bias in data modeling. A common assumption in Heckman selection models is that the error terms follow an independent bivariate normal distribution. However, real-world data often deviates from this assumption, exhibiting heavy-tailed behavior, which can lead to inconsistent estimates if not properly addressed. In this paper, we propose a Bayesian analysis of Heckman selection models that replace the Gaussian assumption with well-known members of the class of scale mixture of normal distributions, such as the Student's-t and contaminated normal distributions. For these complex structures, Stan's default No-U-Turn sampler is utilized to obtain posterior simulations. Through extensive simulation studies, we compare the performance of the Heckman selection models with…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.