FAIRS -- Soft Focus Generator and Attention for Robust Object Segmentation from Extreme Points
Ahmed H. Shahin, Prateek Munjal, Ling Shao, Shadab Khan

TL;DR
FAIRS introduces a novel method for interactive object segmentation using extreme points and corrective clicks, employing a dual attention mechanism to improve accuracy and scalability across large datasets.
Contribution
It proposes an effective encoding of user inputs and a dual attention module, enabling the model to handle variable clicks and refine outputs for robust segmentation.
Findings
Significant improvement over state-of-the-art in dense object segmentation
Effective handling of variable number of user clicks including corrective clicks
High-quality training data generation demonstrated
Abstract
Semantic segmentation from user inputs has been actively studied to facilitate interactive segmentation for data annotation and other applications. Recent studies have shown that extreme points can be effectively used to encode user inputs. A heat map generated from the extreme points can be appended to the RGB image and input to the model for training. In this study, we present FAIRS -- a new approach to generate object segmentation from user inputs in the form of extreme points and corrective clicks. We propose a novel approach for effectively encoding the user input from extreme points and corrective clicks, in a novel and scalable manner that allows the network to work with a variable number of clicks, including corrective clicks for output refinement. We also integrate a dual attention module with our approach to increase the efficacy of the model in preferentially attending to the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Visual Attention and Saliency Detection · Image and Object Detection Techniques
