A degree of image identification at sub-human scales could be possible   with more advanced clusters

Prateek Y J

arXiv:2308.05092·cs.CV·August 10, 2023

A degree of image identification at sub-human scales could be possible with more advanced clusters

Prateek Y J

PDF

Open Access

TL;DR

This research investigates whether self-supervised learning with increased data volume and image quality can achieve human-level visual comprehension at sub-human scales, demonstrating promising results with vision transformers.

Contribution

It introduces a scaling experiment that combines data volume and image resolution increases to reach human-level detection performance without external funding.

Findings

01

Scaling data and image quality improves detection accuracy.

02

Vision transformers achieve human-like performance at small scales.

03

Sub-human size detection becomes feasible with advanced clustering.

Abstract

The purpose of the research is to determine if currently available self-supervised learning techniques can accomplish human level comprehension of visual images using the same degree and amount of sensory input that people acquire from. Initial research on this topic solely considered data volume scaling. Here, we scale both the volume of data and the quality of the image. This scaling experiment is a self-supervised learning method that may be done without any outside financing. We find that scaling up data volume and picture resolution at the same time enables human-level item detection performance at sub-human sizes.We run a scaling experiment with vision transformers trained on up to 200000 images up to 256 ppi.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsImage Processing Techniques and Applications · Domain Adaptation and Few-Shot Learning · Remote-Sensing Image Classification