Single photon in hierarchical architecture for physical reinforcement learning: Photon intelligence
Makoto Naruse, Martin Berthel, Aur\'elien Drezet, Serge Huant,, Hirokazu Hori, Song-Ju Kim

TL;DR
This paper demonstrates a hierarchical single-photon architecture capable of solving complex reinforcement learning problems, showcasing photon-based decision making and exploration abilities for future AI applications.
Contribution
It introduces a scalable hierarchical architecture for photon-based reinforcement learning, extending previous work to more complex multi-armed bandit problems.
Findings
Successfully solved four-armed bandit problem with zero prior knowledge
Hierarchical structure reveals layer-dependent decision conflicts
Probabilistic photon behavior enables direct optimal solution localization
Abstract
Understanding and using natural processes for intelligent functionalities, referred to as natural intelligence, has recently attracted interest from a variety of fields, including post-silicon computing for artificial intelligence and decision making in the behavioural sciences. In a past study, we successfully used the wave-particle duality of single photons to solve the two-armed bandit problem, which constitutes the foundation of reinforcement learning and decision making. In this study, we propose and confirm a hierarchical architecture for single-photon-based reinforcement learning and decision making that verifies the scalability of the principle. Specifically, the four-armed bandit problem is solved given zero prior knowledge in a two-layer hierarchical architecture, where polarization is autonomously adapted in order to effect adequate decision making using single-photon…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNeural Networks and Reservoir Computing · Quantum Information and Cryptography · Random lasers and scattering media
