Learning to Color from Language

Varun Manjunatha; Mohit Iyyer; Jordan Boyd-Graber; Larry; Davis

arXiv:1804.06026·cs.CV·April 18, 2018

Learning to Color from Language

Varun Manjunatha, Mohit Iyyer, Jordan Boyd-Graber, Larry, Davis

PDF

1 Repo

TL;DR

This paper introduces a language-conditioned image colorization method that allows users to control and manipulate the coloring of greyscale images through descriptive captions, improving accuracy and plausibility.

Contribution

It proposes two novel architectures for language-conditioned colorization, enabling more precise and user-controllable colorization of images based on natural language input.

Findings

01

Language-conditioned models outperform language-agnostic ones in colorization accuracy.

02

Manipulating captions effectively changes the colorization results.

03

The approach allows intuitive user control over image coloring.

Abstract

Automatic colorization is the process of adding color to greyscale images. We condition this process on language, allowing end users to manipulate a colorized image by feeding in different captions. We present two different architectures for language-conditioned colorization, both of which produce more accurate and plausible colorizations than a language-agnostic version. Through this language-based framework, we can dramatically alter colorizations by manipulating descriptive color words in captions.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

superhans/colorfromlanguage
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsColorization