On sample complexity for covariance estimation via the unadjusted Langevin algorithm

Shogo Nakakita

arXiv:2601.21717·math.PR·February 16, 2026

On sample complexity for covariance estimation via the unadjusted Langevin algorithm

Shogo Nakakita

PDF

Open Access

TL;DR

This paper provides theoretical guarantees on the number of samples needed for covariance estimation using the unadjusted Langevin algorithm, highlighting efficiency differences between single-chain and parallel approaches.

Contribution

It introduces new sample complexity bounds for covariance estimation with ULA and compares single-chain versus parallel implementations, emphasizing bias reduction effects.

Findings

01

Single-chain ULA has lower sample complexity than embarrassingly parallel ULA by a logarithmic factor.

02

A concentration bound for the sample covariance matrix is derived using a log-Sobolev inequality.

03

The results quantify the efficiency of ULA in covariance estimation for strongly log-concave distributions.

Abstract

We establish sample complexity guarantees for estimating the covariance matrix of a strongly log-concave smooth distribution using the unadjusted Langevin algorithm (ULA). We quantitatively compare our complexity estimates on single-chain ULA with embarrassingly parallel ULA and derive that the sample complexity of the single-chain approach is smaller than that of embarrassingly parallel ULA by a logarithmic factor in the dimension and the reciprocal of the prescribed precision, with the difference arising from effective bias reduction through burn-in. The key technical contribution is a concentration bound for the sample covariance matrix around its expectation, derived via a log-Sobolev inequality for the joint distribution of ULA iterates.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMarkov Chains and Monte Carlo Methods · Random Matrices and Applications · Sparse and Compressive Sensing Techniques