Towards Reinforcement Learning from Neural Feedback: Mapping fNIRS Signals to Agent Performance

Julia Santaniello; Matthew Russell; Benson Jiang; Donatello Sassaroli; Robert Jacob; Jivko Sinapov

arXiv:2511.12844·cs.AI·February 10, 2026

Towards Reinforcement Learning from Neural Feedback: Mapping fNIRS Signals to Agent Performance

Julia Santaniello, Matthew Russell, Benson Jiang, Donatello Sassaroli, Robert Jacob, Jivko Sinapov

PDF

Open Access 1 Video

TL;DR

This paper explores using neural signals from fNIRS to assess and guide reinforcement learning agents' performance, demonstrating feasibility and potential for neural feedback-based training systems.

Contribution

It introduces a novel dataset of fNIRS signals linked to agent performance and develops classifiers and regressors to predict performance levels from neural data.

Findings

01

Achieved 67% F1 score for binary classification of agent performance.

02

Demonstrated improved prediction accuracy with subject-specific fine-tuning.

03

Showed feasibility of using neural signals for reinforcement learning feedback.

Abstract

Reinforcement Learning from Human Feedback (RLHF) is a methodology that aligns agent behavior with human preferences by integrating user feedback into the agent's training process. This paper introduces a framework that guides agent training through implicit neural signals, with a focus on the neural classification problem. Our work presents and releases a novel dataset of functional near-infrared spectroscopy (fNIRS) recordings collected from 25 human participants across three domains: Pick-and-Place Robot, Lunar Lander, and Flappy Bird. We train multiple classifiers to predict varying levels of agent performance (optimal, suboptimal, or worst-case) from windows of preprocessed fNIRS features, achieving an average F1 score of 67% for binary and 46% for multi-class classification across conditions and domains. We also train multiple regressors to predict the degree of deviation between…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

Towards Reinforcement Learning from Neural Feedback: Mapping fNIRS Signals to Agent Performance· underline

Taxonomy

TopicsOptical Imaging and Spectroscopy Techniques · EEG and Brain-Computer Interfaces · Emotion and Mood Recognition