From Knowledge to Noise: CTIM-Rover and the Pitfalls of Episodic Memory in Software Engineering Agents
Tobias Lindenbauer, Georg Groh, Hinrich Sch\"utze

TL;DR
This paper presents CTIM-Rover, an AI agent for Software Engineering that incorporates episodic memory to improve long-term understanding, but finds that noise in memory can hinder performance compared to existing methods.
Contribution
Introduces CTIM-Rover with repository-level episodic memory for SE agents, highlighting challenges of noise and scalability in real-world applications.
Findings
CTIM-Rover does not outperform AutoCodeRover in tested configurations.
Noise from distracting memory items affects performance negatively.
Long-term memory integration remains challenging for practical SE agents.
Abstract
We introduce CTIM-Rover, an AI agent for Software Engineering (SE) built on top of AutoCodeRover (Zhang et al., 2024) that extends agentic reasoning frameworks with an episodic memory, more specifically, a general and repository-level Cross-Task-Instance Memory (CTIM). While existing open-source SE agents mostly rely on ReAct (Yao et al., 2023b), Reflexion (Shinn et al., 2023), or Code-Act (Wang et al., 2024), all of these reasoning and planning frameworks inefficiently discard their long-term memory after a single task instance. As repository-level understanding is pivotal for identifying all locations requiring a patch for fixing a bug, we hypothesize that SE is particularly well positioned to benefit from CTIM. For this, we build on the Experiential Learning (EL) approach ExpeL (Zhao et al., 2024), proposing a Mixture-Of-Experts (MoEs) inspired approach to create both a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsAI-based Problem Solving and Planning · Advanced Software Engineering Methodologies · Multi-Agent Systems and Negotiation
