Spec Kit Agents: Context-Grounded Agentic Workflows

Pardis Taghavi; Santosh Bhavani

arXiv:2604.05278·cs.SE·April 8, 2026

Spec Kit Agents: Context-Grounded Agentic Workflows

Pardis Taghavi, Santosh Bhavani

PDF

1 Repo

TL;DR

Spec Kit Agents enhance AI coding workflows by integrating context-grounding hooks that improve code quality and repository compatibility in large, evolving codebases.

Contribution

The paper introduces Spec Kit Agents, a multi-agent SDD pipeline with phase-level context-grounding and validation hooks, improving code quality and repository adherence.

Findings

01

Context-grounding hooks increase judged quality by +0.15 on a 1-5 scale.

02

The framework maintains 99.7-100% repository-level test compatibility.

03

Augmentation hooks improve baseline performance by 1.7% on SWE-bench Lite.

Abstract

Spec-driven development (SDD) with AI coding agents provides a structured workflow, but agents often remain "context blind" in large, evolving repositories, leading to hallucinated APIs and architectural violations. We present Spec Kit Agents, a multi-agent SDD pipeline (with PM and developer roles) that adds phase-level, context-grounding hooks. Read-only probing hooks ground each stage (Specify, Plan, Tasks, Implement) in repository evidence, while validation hooks check intermediate artifacts against the environment. We evaluate 128 runs covering 32 features across five repositories. Context-grounding hooks improve judged quality by +0.15 on a 1-5 composite LLM-as-judge score (+3.0 percent of the full score; Wilcoxon signed-rank, p < 0.05) while maintaining 99.7-100 percent repository-level test compatibility. We further evaluate the framework on SWE-bench Lite, where augmentation…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

sbhavani/speckit-agents
github

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.