Exploring Machine Learning Regression Models for Advancing Foreground Mitigation and Global 21cm Signal Parameter Extraction
Anshuman Tripathi, Abhirup Datta, and Gursharanjit Kaur

TL;DR
This study evaluates machine learning regression models for extracting parameters from the global 21cm signal, demonstrating that Artificial Neural Networks outperform others, especially with PCA preprocessing, in accuracy and efficiency.
Contribution
The paper compares four machine learning regression models for 21cm signal analysis, highlighting the superior performance of ANNs and the benefits of PCA preprocessing.
Findings
ANN achieves lowest RMSE and highest R^2 scores.
PCA preprocessing improves model accuracy, especially with foreground contamination.
GPR is accurate but computationally intensive.
Abstract
Extracting parameters from the global 21cm signal is crucial for understanding the early Universe. However, detecting the 21cm signal is challenging due to the brighter foreground and associated observational difficulties. In this study, we evaluate the performance of various machine-learning regression models to improve parameter extraction and foreground removal. This evaluation is essential for selecting the most suitable machine learning regression model based on computational efficiency and predictive accuracy. We compare four models: Random Forest Regressor (RFR), Gaussian Process Regressor (GPR), Support Vector Regressor (SVR), and Artificial Neural Networks (ANN). The comparison is based on metrics such as the root mean square error (RMSE) and scores. We examine their effectiveness across different dataset sizes and conditions, including scenarios with foreground…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
