Industrial Language-Image Dataset (ILID): Adapting Vision Foundation Models for Industrial Settings
Keno Moenck, Duc Trung Thieu, Julian Koch, Thorsten Sch\"uppstuhl

TL;DR
This paper introduces ILID, a large-scale industrial language-image dataset, and demonstrates how self-supervised transfer learning on this dataset enhances vision foundation models for industrial applications without requiring manual labeling.
Contribution
The work presents a novel pipeline for creating an industrial dataset from web data and shows effective transfer learning techniques for adapting vision models to industrial settings.
Findings
Effective self-supervised transfer learning improves industrial vision tasks.
ILID enables domain-specific adaptation without manual annotations.
Models trained on ILID outperform baseline approaches in industrial scenarios.
Abstract
In recent years, the upstream of Large Language Models (LLM) has also encouraged the computer vision community to work on substantial multimodal datasets and train models on a scale in a self-/semi-supervised manner, resulting in Vision Foundation Models (VFM), as, e.g., Contrastive Language-Image Pre-training (CLIP). The models generalize well and perform outstandingly on everyday objects or scenes, even on downstream tasks, tasks the model has not been trained on, while the application in specialized domains, as in an industrial context, is still an open research question. Here, fine-tuning the models or transfer learning on domain-specific data is unavoidable when objecting to adequate performance. In this work, we, on the one hand, introduce a pipeline to generate the Industrial Language-Image Dataset (ILID) based on web-crawled data; on the other hand, we demonstrate effective…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Data Processing Techniques
