Mamba-ST: State Space Model for Efficient Style Transfer
Filippo Botti, Alex Ergasti, Leonardo Rossi, Tomaso Fontanini, Claudio, Ferrari, Massimo Bertozzi, Andrea Prati

TL;DR
Mamba-ST introduces a novel state-space model for style transfer that significantly reduces computational resources while maintaining or improving output quality compared to transformer and diffusion-based methods.
Contribution
This paper adapts Mamba's state-space equations for style transfer, eliminating the need for attention or normalization modules, and demonstrates superior efficiency and quality.
Findings
Outperforms transformer and diffusion models in style transfer quality.
Reduces memory and inference time significantly.
Achieves higher ArtFID and FID scores.
Abstract
The goal of style transfer is, given a content image and a style source, generating a new image preserving the content but with the artistic representation of the style source. Most of the state-of-the-art architectures use transformers or diffusion-based models to perform this task, despite the heavy computational burden that they require. In particular, transformers use self- and cross-attention layers which have large memory footprint, while diffusion models require high inference time. To overcome the above, this paper explores a novel design of Mamba, an emergent State-Space Model (SSM), called Mamba-ST, to perform style transfer. To do so, we adapt Mamba linear equation to simulate the behavior of cross-attention layers, which are able to combine two separate embeddings into a single output, but drastically reducing memory usage and time complexity. We modified the Mamba's inner…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Language Processing Techniques · Speech Recognition and Synthesis · Topic Modeling
MethodsSparse Evolutionary Training · Mamba: Linear-Time Sequence Modeling with Selective State Spaces · Diffusion
