Imitation Learning for Fashion Style Based on Hierarchical Multimodal Representation
Shizhu Liu, Shanglin Yang, and Hui Zhou

TL;DR
This paper introduces a hierarchical multimodal adversarial inverse reinforcement learning approach to effectively imitate complex fashion styles from demonstrations, addressing challenges of distribution shift and high-dimensional observations.
Contribution
It proposes HM-AIRL, a novel hierarchical multimodal inverse reinforcement learning framework that improves robustness and accuracy in fashion style imitation tasks.
Findings
HM-AIRL accurately recovers reward functions from multimodal fashion data.
The model demonstrates robustness to variations in style and observations.
It outperforms existing supervised imitation methods in style consistency.
Abstract
Fashion is a complex social phenomenon. People follow fashion styles from demonstrations by experts or fashion icons. However, for machine agent, learning to imitate fashion experts from demonstrations can be challenging, especially for complex styles in environments with high-dimensional, multimodal observations. Most existing research regarding fashion outfit composition utilizes supervised learning methods to mimic the behaviors of style icons. These methods suffer from distribution shift: because the agent greedily imitates some given outfit demonstrations, it can drift away from one style to another styles given subtle differences. In this work, we propose an adversarial inverse reinforcement learning formulation to recover reward functions based on hierarchical multimodal representation (HM-AIRL) during the imitation process. The hierarchical joint representation can more…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenerative Adversarial Networks and Image Synthesis · 3D Shape Modeling and Analysis · Human Motion and Animation
