Semantic Image Cropping

Oriol Corcoll

arXiv:2107.07153·cs.CV·July 16, 2021·1 cites

Semantic Image Cropping

Oriol Corcoll

PDF

Open Access

TL;DR

This paper introduces semantic image cropping, a novel approach that enhances image relevance for specific entities by incorporating semantic information, supported by a new dataset and a deep learning system.

Contribution

It presents the concept of semantic image cropping, a new dataset with ground truth annotations, and a deep learning model that considers semantics for improved cropping.

Findings

01

State-of-the-art aesthetic cropping algorithms perform poorly on semantic cropping.

02

The proposed deep learning system outperforms existing methods in semantic relevance.

03

Semantic information improves the quality of image croppings for specific entities.

Abstract

Automatic image cropping techniques are commonly used to enhance the aesthetic quality of an image; they do it by detecting the most beautiful or the most salient parts of the image and removing the unwanted content to have a smaller image that is more visually pleasing. In this thesis, I introduce an additional dimension to the problem of cropping, semantics. I argue that image cropping can also enhance the image's relevancy for a given entity by using the semantic information contained in the image. I call this problem, Semantic Image Cropping. To support my argument, I provide a new dataset containing 100 images with at least two different entities per image and four ground truth croppings collected using Amazon Mechanical Turk. I use this dataset to show that state-of-the-art cropping algorithms that only take into account aesthetics do not perform well in the problem of semantic…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsVisual Attention and Saliency Detection · Advanced Image and Video Retrieval Techniques · Aesthetic Perception and Analysis