Arcee: Differentiable Recurrent State Chain for Generative Vision Modeling with Mamba SSMs

Jitesh Chavan; Rohit Lal; Anand Kamat; Mengjia Xu

arXiv:2511.11243·cs.CV·November 18, 2025

Arcee: Differentiable Recurrent State Chain for Generative Vision Modeling with Mamba SSMs

Jitesh Chavan, Rohit Lal, Anand Kamat, Mengjia Xu

PDF

Open Access

TL;DR

Arcee introduces a differentiable recurrent state chain for vision modeling with Mamba SSMs, enabling cross-block memory reuse and significantly improving unconditional image generation quality.

Contribution

It proposes a novel cross-block recurrent state chain that reuses terminal state representations, compatible with existing Mamba variants, and enhances generative performance.

Findings

01

Reduces FID from 82.81 to 15.33 on CelebA-HQ

02

Compatible with all prior vision-mamba variants

03

Provides constant, negligible computational overhead

Abstract

State-space models (SSMs), Mamba in particular, are increasingly adopted for long-context sequence modeling, providing linear-time aggregation via an input-dependent, causal selective-scan operation. Along this line, recent "Mamba-for-vision" variants largely explore multiple scan orders to relax strict causality for non-sequential signals (e.g., images). Rather than preserving cross-block memory, the conventional formulation of the selective-scan operation in Mamba reinitializes each block's state-space dynamics from zero, discarding the terminal state-space representation (SSR) from the previous block. Arcee, a cross-block recurrent state chain, reuses each block's terminal state-space representation as the initial condition for the next block. Handoff across blocks is constructed as a differentiable boundary map whose Jacobian enables end-to-end gradient flow across terminal…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsGenerative Adversarial Networks and Image Synthesis · Model Reduction and Neural Networks · Control Systems and Identification