Visual Understanding via Multi-Feature Shared Learning with Global Consistency
Lei Zhang, David Zhang

TL;DR
This paper introduces a multi-feature shared learning framework called GLCC that leverages manifold structure and global label consistency to improve visual recognition accuracy across various multimedia datasets.
Contribution
The paper proposes a novel l_2-norm based multi-feature shared learning method with a group graph regularizer, enhancing semi-supervised visual understanding.
Findings
Outperforms state-of-the-art algorithms on benchmark datasets.
Effectively exploits manifold structure of features.
Demonstrates robustness with deep convolutional features.
Abstract
Image/video data is usually represented with multiple visual features. Fusion of multi-source information for establishing the attributes has been widely recognized. Multi-feature visual recognition has recently received much attention in multimedia applications. This paper studies visual understanding via a newly proposed l_2-norm based multi-feature shared learning framework, which can simultaneously learn a global label matrix and multiple sub-classifiers with the labeled multi-feature data. Additionally, a group graph manifold regularizer composed of the Laplacian and Hessian graph is proposed for better preserving the manifold structure of each feature, such that the label prediction power is much improved through the semi-supervised learning with global label consistency. For convenience, we call the proposed approach Global-Label-Consistent Classifier (GLCC). The merits of the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
