Comparing two samples by penalized logistic regression

Konstantinos Fokianos

arXiv:0807.2563·math.ST·July 17, 2008

Comparing two samples by penalized logistic regression

Konstantinos Fokianos

PDF

TL;DR

This paper introduces a penalized density ratio model for comparing multiple samples, demonstrating improved estimator accuracy and reliable testing of distribution equality, especially in small samples, supported by theoretical analysis and simulations.

Contribution

It extends the density ratio model with penalization, reducing mean square error and resolving estimator existence issues, enhancing distribution comparison methods.

Findings

01

Penalized estimators have lower mean square error than maximum likelihood estimators.

02

Penalization resolves estimator existence problems.

03

Simulation studies support theoretical advantages.

Abstract

Inference based on the penalized density ratio model is proposed and studied. The model under consideration is specified by assuming that the log--likelihood function of two unknown densities is of some parametric form. The model has been extended to cover multiple samples problems while its theoretical properties have been investigated using large sample theory. A main application of the density ratio model is testing whether two, or more, distributions are equal. We extend these results by arguing that the penalized maximum empirical likelihood estimator has less mean square error than that of the ordinary maximum likelihood estimator, especially for small samples. In fact, penalization resolves any existence problems of estimators and a modified Wald type test statistic can be employed for testing equality of the two distributions. A limited simulation study supports further the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.