The cognitive companion: a lightweight parallel monitoring architecture for detecting and recovering from reasoning degradation in LLM agents

Rafflesia Khan; Nafiul Islam Khan

arXiv:2604.13759·cs.AI·April 16, 2026

The cognitive companion: a lightweight parallel monitoring architecture for detecting and recovering from reasoning degradation in LLM agents

Rafflesia Khan, Nafiul Islam Khan

PDF

TL;DR

This paper presents the Cognitive Companion, a lightweight parallel monitoring system for LLM agents that reduces reasoning errors with minimal overhead, especially effective on loop-prone and open-ended tasks.

Contribution

It introduces a novel zero-overhead probe-based monitoring architecture and demonstrates its feasibility and task-dependent effectiveness through empirical studies.

Findings

01

LLM-based Companion reduced repetition by 52-62% on loop-prone tasks.

02

Probe-based Companion achieved AUROC 0.840 on a small dataset.

03

Companion benefits are most significant on loop-prone and open-ended tasks.

Abstract

Large language model (LLM) agents on multi-step tasks suffer reasoning degradation, looping, drift, stuck states, at rates up to 30% on hard tasks. Current solutions include hard step limits (abrupt) or LLM-as-judge monitoring (10-15% overhead per step). This paper introduces the Cognitive Companion, a parallel monitoring architecture with two implementations: an LLM-based Companion and a novel zero-overhead Probe-based Companion. We report a three-batch feasibility study centered on Gemma 4 E4B, with an additional exploratory small-model analysis on Qwen 2.5 1.5B and Llama 3.2 1B. In our experiments, the LLM-based Companion reduced repetition on loop-prone tasks by 52-62% with approximately 11% overhead. The Probe-based Companion, trained on hidden states from layer 28, showed a mean effect size of +0.471 at zero measured inference overhead; its strongest probe result achieved…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.