# Circuit mechanisms of GPe pauses account for adaptive exploration

**Authors:** Sang Wan Lee, Minryung Song, Shinwoo Kang, Minsu Abel Yang, Doo-Sup Choi

PMC · DOI: 10.21203/rs.3.rs-7117998/v1 · Research Square · 2025-08-05

## TL;DR

This paper explores how pauses in the globus pallidus (GPe) help the brain adapt during learning, using a computational model and real brain data.

## Contribution

The study introduces a neurophysiologically grounded model linking GPe pauses to reinforcement learning through a GPe-STN circuit acting like a denoising autoencoder.

## Key findings

- GPe pauses emerge from strong inhibition from GPe to STN, modulating downstream brain circuits.
- GPe-STN activity increases after environmental changes, promoting adaptive exploration.
- Extended training weakens GPe-STN projections, favoring habitual behavior over exploration.

## Abstract

The external globus pallidus (GPe) has traditionally been viewed as a relay nucleus within the basal ganglia (BG), but accumulating evidence indicates a more dynamic role in reinforcement learning (RL). One key characteristic of GPe activity—transient pauses in high-frequency discharge (HFD) neurons—is preserved across species, yet its potential implications in RL remains unclear. Here, we developed a neurophysiologically grounded computational model to investigate the origin and role of GPe pauses in RL. Our model successfully replicated a range of empirical observations, including pause dynamics during learning and cue-related activity modulation. We demonstrated that the GPe-subthalamic nucleus (STN) circuit functions analogously to a denoising autoencoder, modulating baseline excitability in downstream BG circuits and that GPe pauses emerge as circuit-level consequences of strong, convergent inhibition from the GPe to STN. Simulations and in vivo recordings revealed that the activity of GPe-STN projecting neurons increases following sudden environmental changes, promoting adaptive exploration by disrupting action value contrast. Intriguingly, this same configuration impairs performance with extended training, suggesting that habitual behavior may benefit from weakened GPe-to-STN projections. These findings provide a unifying framework for understanding GPe pause dynamics and highlight circuit-level distinctions supporting the balance between flexibility and proficiency in RL.

## Full-text entities

- **Genes:** crf (cream fur) [NCBI Gene 12917], Nes (nestin) [NCBI Gene 18008] {aka ESTM46, Ifaprc2, Marc2, RC2}, Gpi1 (glucose-6-phosphate isomerase 1) [NCBI Gene 14751] {aka Amf, Gpi, Gpi-1, Gpi-1r, Gpi-1s, Gpi-1t}, Crhr1 (corticotropin releasing hormone receptor 1) [NCBI Gene 12921] {aka CRF-R1alpha, CRF1R, CRFR1, Crhr}, Crh (corticotropin releasing hormone) [NCBI Gene 12918] {aka CRF, Gm1347}, Gfap (glial fibrillary acidic protein) [NCBI Gene 14580]
- **Diseases:** HFD (MESH:D006316), pupil dilation (MESH:D011681), nose-pokes (MESH:D009668)
- **Chemicals:** DAPI (MESH:C007293), paraformaldehyde (MESH:C003043), GPe (MESH:C062053), oxygen (MESH:D010100), Ca2+ (-), isoflurane (MESH:D007530), sucrose (MESH:D013395), Calcium (MESH:D002118)
- **Species:** Mus musculus (house mouse, species) [taxon 10090], Cercopithecidae (monkey, family) [taxon 9527], Homo sapiens (human, species) [taxon 9606]
- **Cell lines:** DAE-B — Homo sapiens (Human), Telomerase immortalized cell line (CVCL_T028)

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12340893/full.md

## Figures

6 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12340893/full.md

## References

64 references — full list in the complete paper: https://tomesphere.com/paper/PMC12340893/full.md

---
Source: https://tomesphere.com/paper/PMC12340893