Social Media Fashion Knowledge Extraction as Captioning
Yifei Yuan, Wenxuan Zhang, Yang Deng, and Wai Lam

TL;DR
This paper introduces a novel approach to extract fashion knowledge from social media posts by transforming the task into caption generation, leveraging multimodal pre-trained models and a new annotated dataset.
Contribution
The work reformulates fashion knowledge extraction as a captioning task using multimodal pre-trained models and creates a new dataset with manual annotations for social media posts.
Findings
The proposed model effectively generates fashion knowledge captions from social media posts.
The multimodal pre-trained model outperforms traditional classification-based methods.
Auxiliary tasks enhance the accuracy of knowledge extraction.
Abstract
Social media plays a significant role in boosting the fashion industry, where a massive amount of fashion-related posts are generated every day. In order to obtain the rich fashion information from the posts, we study the task of social media fashion knowledge extraction. Fashion knowledge, which typically consists of the occasion, person attributes, and fashion item information, can be effectively represented as a set of tuples. Most previous studies on fashion knowledge extraction are based on the fashion product images without considering the rich text information in social media posts. Existing work on fashion knowledge extraction in social media is classification-based and requires to manually determine a set of fashion knowledge categories in advance. In our work, we propose to cast the task as a captioning problem to capture the interplay of the multimodal post information.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenerative Adversarial Networks and Image Synthesis · Digital Media and Visual Art
