From Knowledge to Noise: CTIM-Rover and the Pitfalls of Episodic Memory in Software Engineering Agents

Tobias Lindenbauer; Georg Groh; Hinrich Sch\"utze

arXiv:2505.23422·cs.SE·May 30, 2025

From Knowledge to Noise: CTIM-Rover and the Pitfalls of Episodic Memory in Software Engineering Agents

Tobias Lindenbauer, Georg Groh, Hinrich Sch\"utze

PDF

Open Access 1 Repo 1 Video

TL;DR

This paper presents CTIM-Rover, an AI agent for Software Engineering that incorporates episodic memory to improve long-term understanding, but finds that noise in memory can hinder performance compared to existing methods.

Contribution

Introduces CTIM-Rover with repository-level episodic memory for SE agents, highlighting challenges of noise and scalability in real-world applications.

Findings

01

CTIM-Rover does not outperform AutoCodeRover in tested configurations.

02

Noise from distracting memory items affects performance negatively.

03

Long-term memory integration remains challenging for practical SE agents.

Abstract

We introduce CTIM-Rover, an AI agent for Software Engineering (SE) built on top of AutoCodeRover (Zhang et al., 2024) that extends agentic reasoning frameworks with an episodic memory, more specifically, a general and repository-level Cross-Task-Instance Memory (CTIM). While existing open-source SE agents mostly rely on ReAct (Yao et al., 2023b), Reflexion (Shinn et al., 2023), or Code-Act (Wang et al., 2024), all of these reasoning and planning frameworks inefficiently discard their long-term memory after a single task instance. As repository-level understanding is pivotal for identifying all locations requiring a patch for fixing a bug, we hypothesize that SE is particularly well positioned to benefit from CTIM. For this, we build on the Experiential Learning (EL) approach ExpeL (Zhao et al., 2024), proposing a Mixture-Of-Experts (MoEs) inspired approach to create both a…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

liqs-v2/ctim-rover
noneOfficial

Videos

From Knowledge to Noise: CTIM-Rover and the Pitfalls of Episodic Memory in Software Engineering Agents· underline

Taxonomy

TopicsAI-based Problem Solving and Planning · Advanced Software Engineering Methodologies · Multi-Agent Systems and Negotiation