Stacking for machine learning redshifts applied to SDSS galaxies
Roman Zitlau, Ben Hoyle, Kerstin Paech, Jochen Weller, Markus Michael, Rau, Stella Seitz

TL;DR
This paper demonstrates that stacking machine learning models, including both weak and strong learners, significantly improves photometric redshift estimation for SDSS galaxies with minimal additional computational cost.
Contribution
It introduces and evaluates stacking architectures for photometric redshift estimation, showing consistent improvements across various algorithms and configurations.
Findings
Stacking improves weak learner performance by 1.9% to 21%.
Stacking yields 0.4% to 2.5% improvement for strong learners like AdaBoost.
Stacking benefits are achieved with minimal extra computational effort.
Abstract
We present an analysis of a general machine learning technique called 'stacking' for the estimation of photometric redshifts. Stacking techniques can feed the photometric redshift estimate, as output by a base algorithm, back into the same algorithm as an additional input feature in a subsequent learning round. We shown how all tested base algorithms benefit from at least one additional stacking round (or layer). To demonstrate the benefit of stacking, we apply the method to both unsupervised machine learning techniques based on self-organising maps (SOMs), and supervised machine learning methods based on decision trees. We explore a range of stacking architectures, such as the number of layers and the number of base learners per layer. Finally we explore the effectiveness of stacking even when using a successful algorithm such as AdaBoost. We observe a significant improvement of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
