Least Squares Estimation For Hierarchical Data
Ryan Cumings-Menon, Pavel Zhuravlev

TL;DR
This paper presents an efficient algorithm for high-dimensional least squares estimation leveraging hierarchical data structures, applied to Census Bureau's noisy 2020 Census measurements to assess uncertainty.
Contribution
It introduces a computationally efficient algorithm for hierarchical least squares estimation and demonstrates its application to Census data for uncertainty quantification.
Findings
Algorithm computes high-dimensional least squares estimates efficiently.
Estimator's output matches the generalized least squares estimator.
Provides confidence intervals for Census tabulations using the estimator.
Abstract
The U.S. Census Bureau's 2020 Disclosure Avoidance System (DAS) bases its output on noisy measurements, which are population tabulations added to realizations of mean-zero random variables. These noisy measurements are observed for a set of hierarchical geographic levels, e.g., the U.S. as a whole, states, counties, census tracts, and census blocks. The Census Bureau released the noisy measurements generated in the DAS executions for the two primary 2020 Census data products, in part to allow data users to assess uncertainty in 2020 Census tabulations introduced by disclosure avoidance. This paper describes an algorithm that can leverage the hierarchical structure of the input data in order to compute very high dimensional least squares estimates in a computationally efficient manner. Afterward, we show that this algorithm's output is equal to the generalized least squares estimator,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
