# Effect of the reinforcement rate on goal-directed and habitual choices in a multiple schedule

**Authors:** Ting Hu, Shun Fujimaki, Hiroto Kawarada, Yutaka Kosaki

PMC · DOI: 10.3389/fpsyg.2025.1601901 · Frontiers in Psychology · 2025-07-02

## TL;DR

This study explores how different reinforcement rates affect whether rats make goal-directed or habitual choices in a complex task.

## Contribution

The study reveals that lean reinforcement conditions maintain goal-directed behavior even after extensive training, unlike rich conditions.

## Key findings

- Lean reinforcement groups showed reduced performance after outcome devaluation, indicating goal-directed behavior.
- Rich reinforcement groups exhibited habitual behavior, with no significant change after outcome devaluation.
- Alternating reinforcement contingencies under lean conditions consistently sustained goal-directed control.

## Abstract

Voluntary behaviors can be either goal-directed, sensitive to changes in their consequences, or habitual, lacking such sensitivity. In this study, we conducted three experiments to investigate how forced-choice training influences goal-directed and habitual processes under varying reinforcement rates. In all experiments, rats received 15 training sessions on a two-component multiple schedule with two sequentially inserted levers. In Experiment 1, identical variable interval (VI) 15-s schedules were used across components for Group Rich and VI 90-s schedules for Group Lean, yielding different behavioral outcomes. Following taste aversion for one outcome, Group Lean reduced performance (i.e., goal-directed action) during an extinction test, while Group Rich did not (i.e., habit). Experiment 2 addressed differential outcome exposure by reversing training conditions: Group Rich received numerous outcomes equivalent to Group Lean in Experiment 1, and vice versa. The devaluation effects were evident in both groups. Using the same outcome across components, Experiment 3 trained rats on a multiple VI 15-s VI 90-s schedule to further clarify the role of response–outcome pairings while controlling for the total amount of outcome exposure. Although the VI 15-s component produced fewer outcomes, it led to stronger devaluation effects and residual responding. The most important finding of this study is that alternating R–O contingencies in a multiple schedule under lean reinforcement conditions consistently sustain goal-directed control even after extensive training, while richer conditions promote a shift to habitual control. These findings are discussed within a dual-system model framework in a molar context, hypothesizing that both goal-directed and habitual strength may grow more rapidly with higher reinforcement rates.

## Linked entities

- **Species:** Rattus norvegicus (taxon 10116)

## Full-text entities

- **Diseases:** taste aversion (MESH:D020018)
- **Species:** Rattus norvegicus (brown rat, species) [taxon 10116]

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12263642/full.md

## Figures

3 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12263642/full.md

## References

37 references — full list in the complete paper: https://tomesphere.com/paper/PMC12263642/full.md

---
Source: https://tomesphere.com/paper/PMC12263642