Random forest automated supervised classification of Hipparcos periodic   variable stars

P. Dubath; L. Rimoldini; M. S\"uveges; J. Blomme; M. L\'opez; L. M.; Sarro; J. De Ridder; J. Cuypers; L. Guy; I. Lecoeur; K. Nienartowicz; A. Jan,; M. Beck; N. Mowlavi; P. De Cat; T. Lebzelter; L. Eyer

arXiv:1101.2406·astro-ph.SR·May 27, 2015

Random forest automated supervised classification of Hipparcos periodic variable stars

P. Dubath, L. Rimoldini, M. S\"uveges, J. Blomme, M. L\'opez, L. M., Sarro, J. De Ridder, J. Cuypers, L. Guy, I. Lecoeur, K. Nienartowicz, A. Jan,, M. Beck, N. Mowlavi, P. De Cat, T. Lebzelter, L. Eyer

PDF

TL;DR

This paper evaluates the effectiveness of using random forest algorithms for automated classification of Hipparcos periodic variable stars into 26 types, achieving high accuracy with 90-100% correct classification rates.

Contribution

It introduces a supervised classification approach utilizing random forests and a multi-stage scheme, identifying key attributes for accurate star type classification.

Findings

01

Achieved 90-100% correct classification accuracy.

02

Identified key attributes like period, amplitude, and colour index for classification.

03

Provided online dataset of training and predicted star types.

Abstract

We present an evaluation of the performance of an automated classification of the Hipparcos periodic variable stars into 26 types. The sub-sample with the most reliable variability types available in the literature is used to train supervised algorithms to characterize the type dependencies on a number of attributes. The most useful attributes evaluated with the random forest methodology include, in decreasing order of importance, the period, the amplitude, the V-I colour index, the absolute magnitude, the residual around the folded light-curve model, the magnitude distribution skewness and the amplitude of the second harmonic of the Fourier series model relative to that of the fundamental frequency. Random forests and a multi-stage scheme involving Bayesian network and Gaussian mixture methods lead to statistically equivalent results. In standard 10-fold cross-validation experiments,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.