Optimizing measurement-based cooling by reinforcement learning
Jia-shun Yan, Jun Jing

TL;DR
This paper introduces an optimized measurement-based cooling method using reinforcement learning, achieving significant population reduction in a target resonator with a high success probability by combining conditional and unconditional strategies.
Contribution
It derives an analytical optimal measurement interval and develops a reinforcement learning algorithm to maximize cooling efficiency in measurement-based quantum cooling.
Findings
Average population reduced by four orders of magnitude after 16 rounds
Success probability of about 30%
Optimal measurement interval inversely proportional to Rabi frequency
Abstract
Conditional cooling-by-measurement holds a significant advantage over its unconditional (nonselective) counterpart in the average-population-reduction rate. However, it has a clear weakness with respect to the limited success probability of finding the detector in the measured state. In this work, we propose an optimized architecture to cool down a target resonator, which is initialized as a thermal state, using an interpolation of conditional and unconditional measurement strategies. An optimal measurement-interval for unconditional measurement is analytically derived for the first time, which is inversely proportional to the collective dominant Rabi frequency as a function of the resonator's population in the end of the last round. A cooling algorithm under global optimization by the reinforcement learning results in the maximum value for the cooperative…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
