The cognitive companion: a lightweight parallel monitoring architecture for detecting and recovering from reasoning degradation in LLM agents
Rafflesia Khan, Nafiul Islam Khan

TL;DR
This paper presents the Cognitive Companion, a lightweight parallel monitoring system for LLM agents that reduces reasoning errors with minimal overhead, especially effective on loop-prone and open-ended tasks.
Contribution
It introduces a novel zero-overhead probe-based monitoring architecture and demonstrates its feasibility and task-dependent effectiveness through empirical studies.
Findings
LLM-based Companion reduced repetition by 52-62% on loop-prone tasks.
Probe-based Companion achieved AUROC 0.840 on a small dataset.
Companion benefits are most significant on loop-prone and open-ended tasks.
Abstract
Large language model (LLM) agents on multi-step tasks suffer reasoning degradation, looping, drift, stuck states, at rates up to 30% on hard tasks. Current solutions include hard step limits (abrupt) or LLM-as-judge monitoring (10-15% overhead per step). This paper introduces the Cognitive Companion, a parallel monitoring architecture with two implementations: an LLM-based Companion and a novel zero-overhead Probe-based Companion. We report a three-batch feasibility study centered on Gemma 4 E4B, with an additional exploratory small-model analysis on Qwen 2.5 1.5B and Llama 3.2 1B. In our experiments, the LLM-based Companion reduced repetition on loop-prone tasks by 52-62% with approximately 11% overhead. The Probe-based Companion, trained on hidden states from layer 28, showed a mean effect size of +0.471 at zero measured inference overhead; its strongest probe result achieved…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
