HYPE: Hyperbolic Entailment Filtering for Underspecified Images and Texts
Wonjae Kim, Sanghyuk Chun, Taekyung Kim, Dongyoon Han and, Sangdoo Yun

TL;DR
HYPE introduces hyperbolic entailment filtering to improve data quality in image-text datasets, leading to better self-supervised learning by selecting more meaningful and specific samples, and achieving state-of-the-art results.
Contribution
The paper presents a novel hyperbolic embedding-based filtering method that enhances data specificity and alignment in large-scale datasets for self-supervised learning.
Findings
HYPE significantly improves filtering efficiency.
Sets new state-of-the-art on DataComp benchmark.
Image specificity measure outperforms CLIP score for dataset induction.
Abstract
In an era where the volume of data drives the effectiveness of self-supervised learning, the specificity and clarity of data semantics play a crucial role in model training. Addressing this, we introduce HYPerbolic Entailment filtering (HYPE), a novel methodology designed to meticulously extract modality-wise meaningful and well-aligned data from extensive, noisy image-text pair datasets. Our approach leverages hyperbolic embeddings and the concept of entailment cones to evaluate and filter out samples with meaningless or underspecified semantics, focusing on enhancing the specificity of each data sample. HYPE not only demonstrates a significant improvement in filtering efficiency but also sets a new state-of-the-art in the DataComp benchmark when combined with existing filtering techniques. This breakthrough showcases the potential of HYPE to refine the data selection process, thereby…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsImage Retrieval and Classification Techniques · Video Analysis and Summarization · Digital Media Forensic Detection
MethodsAttention Is All You Need · Softmax · RAdam · Graph Self-Attention · Hyperboloid Embeddings · Contrastive Language-Image Pre-training
