SEM-Net: Efficient Pixel Modelling for image inpainting with Spatially Enhanced SSM
Shuang Chen, Haozheng Zhang, Amir Atapour-Abarghouei, Hubert P. H., Shum

TL;DR
SEM-Net is a novel pixel-level inpainting model that efficiently captures long-range dependencies using a state space approach, outperforming existing methods in spatial consistency and generalizing well to other tasks like motion deblurring.
Contribution
Introduces SEM-Net, a state space model with innovative spatially-enhanced modules for improved long-range dependency modeling in image inpainting.
Findings
Outperforms state-of-the-art inpainting methods on two datasets.
Achieves significant improvements in spatial consistency.
Demonstrates strong generalization to motion deblurring.
Abstract
Image inpainting aims to repair a partially damaged image based on the information from known regions of the images. \revise{Achieving semantically plausible inpainting results is particularly challenging because it requires the reconstructed regions to exhibit similar patterns to the semanticly consistent regions}. This requires a model with a strong capacity to capture long-range dependencies. Existing models struggle in this regard due to the slow growth of receptive field for Convolutional Neural Networks (CNNs) based methods and patch-level interactions in Transformer-based methods, which are ineffective for capturing long-range dependencies. Motivated by this, we propose SEM-Net, a novel visual State Space model (SSM) vision network, modelling corrupted images at the pixel level while capturing long-range dependencies (LRDs) in state space, achieving a linear computational…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Numerical Analysis Techniques · Generative Adversarial Networks and Image Synthesis · Industrial Vision Systems and Defect Detection
MethodsDense Connections · Mamba: Linear-Time Sequence Modeling with Selective State Spaces · Inpainting · Feedforward Network
