RAWMamba: Unified sRGB-to-RAW De-rendering With State Space Model
Hongjun Chen, Wencheng Han, Huan Zheng, Jianbing Shen

TL;DR
RAWMamba is a unified framework that effectively reconstructs RAW images from sRGB data across both image and video domains by harmonizing diverse metadata types and capturing long-range dependencies.
Contribution
It introduces a unified architecture with the UME module and LTA-Mamba for sRGB-to-RAW de-rendering, simplifying deployment and improving performance.
Findings
Achieves state-of-the-art RAW reconstruction quality
Effectively handles both image and video data
Outperforms existing specialized methods
Abstract
Recent advancements in sRGB-to-RAW de-rendering have increasingly emphasized metadata-driven approaches to reconstruct RAW data from sRGB images, supplemented by partial RAW information. In image-based de-rendering, metadata is commonly obtained through sampling, whereas in video tasks, it is typically derived from the initial frame. The distinct metadata requirements necessitate specialized network architectures, leading to architectural incompatibilities that increase deployment complexity. In this paper, we propose RAWMamba, a Mamba-based unified framework developed for sRGB-to-RAW de-rendering across both image and video domains. The core of RAWMamba is the Unified Metadata Embedding (UME) module, which harmonizes diverse metadata types into a unified representation. In detail, a multi-perspective affinity modeling method is proposed to promote the extraction of reference…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsComputer Graphics and Visualization Techniques · Advanced Vision and Imaging · Video Coding and Compression Technologies
MethodsMamba: Linear-Time Sequence Modeling with Selective State Spaces
