Learning A Low-Level Vision Generalist via Visual Task Prompt
Xiangyu Chen, Yihao Liu, Yuandong Pu, Wenlong Zhang, Jiantao Zhou, Yu, Qiao, Chao Dong

TL;DR
This paper introduces VPIP, a flexible framework using visual prompts to train a low-level vision generalist model, GenLV, capable of handling diverse tasks with improved performance over existing methods.
Contribution
The paper proposes a novel VPIP framework with prompt cross-attention, enabling a single model to effectively perform multiple low-level vision tasks.
Findings
GenLV outperforms existing methods quantitatively.
GenLV demonstrates strong qualitative results across tasks.
The framework effectively manages diverse input-target domains.
Abstract
Building a unified model for general low-level vision tasks holds significant research and practical value. Current methods encounter several critical issues. Multi-task restoration approaches can address multiple degradation-to-clean restoration tasks, while their applicability to tasks with different target domains (e.g., image stylization) is limited. Methods like PromptGIP can handle multiple input-target domains but rely on the Masked Autoencoder (MAE) paradigm. Consequently, they are tied to the ViT architecture, resulting in suboptimal image reconstruction quality. In addition, these methods are sensitive to prompt image content and often struggle with low-frequency information processing. In this paper, we propose a Visual task Prompt-based Image Processing (VPIP) framework to overcome these challenges. VPIP employs visual task prompts to manage tasks with different input-target…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsCCD and CMOS Imaging Sensors · Advanced Vision and Imaging · Image Processing Techniques and Applications
