Local Manifold Augmentation for Multiview Semantic Consistency
Yu Yang, Wing Yin Cheung, Chang Liu, Xiangyang Ji

TL;DR
This paper introduces local manifold augmentation (LMA), a novel data augmentation technique that generates diverse, semantically consistent views by modeling data variations, significantly improving self-supervised learning performance and robustness across multiple datasets.
Contribution
The paper proposes LMA, an instance-conditioned generator that captures local data variations to produce diverse augmentations, enhancing multiview semantic consistency in self-supervised learning.
Findings
LMA improves accuracy on CIFAR, STL10, and ImageNet benchmarks.
LMA enhances invariance to viewpoint, pose, and illumination.
LMA increases robustness to distribution shifts like ImageNet-V2 and Sketch.
Abstract
Multiview self-supervised representation learning roots in exploring semantic consistency across data of complex intra-class variation. Such variation is not directly accessible and therefore simulated by data augmentations. However, commonly adopted augmentations are handcrafted and limited to simple geometrical and color changes, which are unable to cover the abundant intra-class variation. In this paper, we propose to extract the underlying data variation from datasets and construct a novel augmentation operator, named local manifold augmentation (LMA). LMA is achieved by training an instance-conditioned generator to fit the distribution on the local manifold of data and sampling multiview data using it. LMA shows the ability to create an infinite number of data views, preserve semantics, and simulate complicated variations in object pose, viewpoint, lighting condition, background…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Multimodal Machine Learning Applications · Advanced Image and Video Retrieval Techniques
