Multivariate Statistical Analysis of Exoplanet Habitability: Detection Bias and Earth Analog Identification
Caleb Traxler, Samuel Townsend, Abby Mori, Grace Newman, Kaitlyn Morenzone

TL;DR
This study uses multivariate statistical methods on 517 exoplanets to identify Earth-like candidates, revealing significant detection biases and emphasizing the rarity of truly habitable worlds while highlighting promising targets for future observation.
Contribution
It introduces a novel multivariate classification framework for exoplanet habitability and quantifies detection bias in current surveys.
Findings
Only 0.6% of exoplanets meet all habitability criteria.
Significant observational bias favors unsuitable planetary systems.
Earth is a statistical outlier among exoplanets.
Abstract
We present a comprehensive multivariate statistical analysis of 517 exoplanets from the NASA Exoplanet Archive to identify potentially habitable worlds and quantify detection bias in current surveys. Using eight key parameters (planetary radius, equilibrium temperature, insolation flux, density, and stellar effective temperature, radius, mass, metallicity), we developed a classification framework that successfully identifies Earth as an "Excellent Candidate" for habitability. Our analysis reveals that only 0.6% (3 planets including Earth) meet all habitability criteria under relaxed thresholds, while 75.0% exhibit "Good Star, Poor Planet" characteristics, indicating significant observational bias toward unsuitable planetary systems. Hotelling's T2 test demonstrates that potentially habitable planets are statistically significantly different from the general exoplanet population (p =…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
