Memory Efficient Matting with Adaptive Token Routing
Yiheng Lin, Yihan Hu, Chenyi Zhang, Ting Liu, Xiaochao Qu, Luoqi Liu,, Yao Zhao, Yunchao Wei

TL;DR
MEMatte is a novel memory-efficient image matting framework that uses adaptive token routing to process high-resolution images effectively, reducing memory and computational costs while maintaining high accuracy.
Contribution
The paper introduces MEMatte, which employs a dynamic routing mechanism and a lightweight token refinement module to enable high-resolution image matting with significantly reduced memory and latency.
Findings
Reduces memory usage by approximately 88% on high-resolution images.
Cuts latency by about 50% compared to existing methods.
Outperforms current state-of-the-art in high-resolution and real-world datasets.
Abstract
Transformer-based models have recently achieved outstanding performance in image matting. However, their application to high-resolution images remains challenging due to the quadratic complexity of global self-attention. To address this issue, we propose MEMatte, a \textbf{m}emory-\textbf{e}fficient \textbf{m}atting framework for processing high-resolution images. MEMatte incorporates a router before each global attention block, directing informative tokens to the global attention while routing other tokens to a Lightweight Token Refinement Module (LTRM). Specifically, the router employs a local-global strategy to predict the routing probability of each token, and the LTRM utilizes efficient modules to simulate global attention. Additionally, we introduce a Batch-constrained Adaptive Token Routing (BATR) mechanism, which allows each router to dynamically route tokens based on image…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsAlgorithms and Data Compression · Advanced Image and Video Retrieval Techniques · Oral and Maxillofacial Pathology
MethodsSoftmax · Attention Is All You Need
