Cosmic Bandits: Exploration versus Exploitation in CMB B-Mode Experiments
Ely D. Kovetz, Marc Kamionkowski

TL;DR
This paper introduces adaptive survey strategies inspired by multi-armed bandit algorithms to optimize the detection of B-mode polarization in the CMB, effectively balancing foreground exploration and signal integration.
Contribution
It develops a novel adaptive approach for CMB B-mode experiments that improves sensitivity by dynamically selecting observation regions based on foreground estimates.
Findings
Adaptive strategies can improve upper bounds on tensor-to-scalar ratio by factors of 2-3.
Realistic foreground models demonstrate significant gains over traditional fixed surveys.
Techniques are applicable to various astrophysical surveys beyond CMB B-mode detection.
Abstract
A preferred method to detect the curl-component, or B-mode, signature of inflationary gravitational waves (IGWs) in the cosmic microwave background (CMB) polarization, in the absence of foregrounds and lensing, is a prolonged integration over a single patch of sky of a few square degrees. In practice, however, foregrounds abound and the sensitivity to B modes can be improved considerably by finding the region of sky cleanest of foregrounds. The best strategy to detect B modes thus involves a tradeoff between exploration (to find lower-foreground patches) and exploitation (through prolonged integration). This problem is akin to the multi-armed bandit (MAB) problem in probability theory, wherein a gambler faces a series of slot machines with unknown winning odds and must develop a strategy to maximize his/her winnings with some finite number of pulls. While the optimal MAB strategy…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
