Homogeneity in Regression

Tracy Ke; Jianqing Fan; Yichao Wu

arXiv:1303.7409·stat.ME·April 1, 2013·1 cites

Homogeneity in Regression

Tracy Ke, Jianqing Fan, Yichao Wu

PDF

Open Access

TL;DR

This paper introduces CARDS, a new clustering algorithm for high-dimensional regression that exploits coefficient homogeneity, improving estimation accuracy by leveraging low-dimensional structures like homogeneity and sparsity.

Contribution

The paper proposes a novel data-driven segmentation method called CARDS to explore homogeneity in regression coefficients, with theoretical analysis and practical applications.

Findings

01

CARDS achieves better estimation accuracy for homogeneous parameters.

02

The asymptotic normality of CARDS estimator is established.

03

Combining homogeneity and sparsity exploration enhances efficiency.

Abstract

This paper explores the homogeneity of coefficients in high-dimensional regression, which extends the sparsity concept and is more general and suitable for many applications. Homogeneity arises when one expects regression coefficients corresponding to neighboring geographical regions or a similar cluster of covariates to be approximately the same. Sparsity corresponds to a special case of homogeneity with a known atom zero. In this article, we propose a new method called clustering algorithm in regression via data-driven segmentation (CARDS) to explore homogeneity. New mathematics are provided on the gain that can be achieved by exploring homogeneity. Statistical properties of two versions of CARDS are analyzed. In particular, the asymptotic normality of our proposed CARDS estimator is established, which reveals better estimation accuracy for homogeneous parameters than that without…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsStatistical Methods and Inference · Bayesian Methods and Mixture Models · Gene expression and cancer classification