Imitation Learning for Fashion Style Based on Hierarchical Multimodal   Representation

Shizhu Liu; Shanglin Yang; and Hui Zhou

arXiv:2004.06229·cs.LG·April 15, 2020·1 cites

Imitation Learning for Fashion Style Based on Hierarchical Multimodal Representation

Shizhu Liu, Shanglin Yang, and Hui Zhou

PDF

Open Access 1 Repo

TL;DR

This paper introduces a hierarchical multimodal adversarial inverse reinforcement learning approach to effectively imitate complex fashion styles from demonstrations, addressing challenges of distribution shift and high-dimensional observations.

Contribution

It proposes HM-AIRL, a novel hierarchical multimodal inverse reinforcement learning framework that improves robustness and accuracy in fashion style imitation tasks.

Findings

01

HM-AIRL accurately recovers reward functions from multimodal fashion data.

02

The model demonstrates robustness to variations in style and observations.

03

It outperforms existing supervised imitation methods in style consistency.

Abstract

Fashion is a complex social phenomenon. People follow fashion styles from demonstrations by experts or fashion icons. However, for machine agent, learning to imitate fashion experts from demonstrations can be challenging, especially for complex styles in environments with high-dimensional, multimodal observations. Most existing research regarding fashion outfit composition utilizes supervised learning methods to mimic the behaviors of style icons. These methods suffer from distribution shift: because the agent greedily imitates some given outfit demonstrations, it can drift away from one style to another styles given subtle differences. In this work, we propose an adversarial inverse reinforcement learning formulation to recover reward functions based on hierarchical multimodal representation (HM-AIRL) during the imitation process. The hierarchical joint representation can more…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

AemikaChow/DATASOURCE
none

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsGenerative Adversarial Networks and Image Synthesis · 3D Shape Modeling and Analysis · Human Motion and Animation