Visual Tracking via Dynamic Graph Learning
Chenglong Li, Liang Lin, Wangmeng Zuo, Jin Tang, Ming-Hsuan Yang

TL;DR
This paper introduces a dynamic graph learning approach for visual tracking, modeling objects with patch-based graphs that adaptively refine foreground-background separation to improve tracking accuracy.
Contribution
It proposes a novel patch-based graph learning framework with dynamic optimization for robust visual tracking, enhancing object localization by refining patch weights during tracking.
Findings
Outperforms state-of-the-art methods on benchmark datasets.
Effectively refines foreground-background separation during tracking.
Demonstrates robustness to background clutter and appearance changes.
Abstract
Existing visual tracking methods usually localize a target object with a bounding box, in which the performance of the foreground object trackers or detectors is often affected by the inclusion of background clutter. To handle this problem, we learn a patch-based graph representation for visual tracking. The tracked object is modeled by with a graph by taking a set of non-overlapping image patches as nodes, in which the weight of each node indicates how likely it belongs to the foreground and edges are weighted for indicating the appearance compatibility of two neighboring nodes. This graph is dynamically learned and applied in object tracking and model updating. During the tracking process, the proposed algorithm performs three main steps in each frame. First, the graph is initialized by assigning binary weights of some image patches to indicate the object and background patches…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsVideo Surveillance and Tracking Methods · Advanced Image and Video Retrieval Techniques · Visual Attention and Saliency Detection
