Multimodal Prediction and Personalization of Photo Edits with Deep   Generative Models

Ardavan Saeedi; Matthew D. Hoffman; Stephen J. DiVerdi; Asma; Ghandeharioun; Matthew J. Johnson; Ryan P. Adams

arXiv:1704.04997·stat.ML·April 18, 2017·5 cites

Multimodal Prediction and Personalization of Photo Edits with Deep Generative Models

Ardavan Saeedi, Matthew D. Hoffman, Stephen J. DiVerdi, Asma, Ghandeharioun, Matthew J. Johnson, Ryan P. Adams

PDF

Open Access

TL;DR

This paper introduces a deep generative model for personalized photo editing suggestions that predicts diverse, high-quality edits and adapts to individual aesthetic preferences, improving over existing methods.

Contribution

The work presents a novel hierarchical neural network model that captures diverse editing styles and personalizes suggestions based on user preferences in photo editing.

Findings

01

Model outperforms existing approaches on multimodal prediction tasks.

02

Generates diverse and high-quality photo edit suggestions.

03

Learns and adapts to individual user aesthetic preferences.

Abstract

Professional-grade software applications are powerful but complicated $-$ expert users can achieve impressive results, but novices often struggle to complete even basic tasks. Photo editing is a prime example: after loading a photo, the user is confronted with an array of cryptic sliders like "clarity", "temp", and "highlights". An automatically generated suggestion could help, but there is no single "correct" edit for a given image $-$ different experts may make very different aesthetic decisions when faced with the same image, and a single expert may make different choices depending on the intended use of the image (or on a whim). We therefore want a system that can propose multiple diverse, high-quality edits while also learning from and adapting to a user's aesthetic preferences. In this work, we develop a statistical model that meets these objectives. Our model builds on recent…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsImage Retrieval and Classification Techniques · Advanced Image and Video Retrieval Techniques · Generative Adversarial Networks and Image Synthesis