Social Media Fashion Knowledge Extraction as Captioning

Yifei Yuan; Wenxuan Zhang; Yang Deng; and Wai Lam

arXiv:2309.16270·cs.CL·September 29, 2023·1 cites

Social Media Fashion Knowledge Extraction as Captioning

Yifei Yuan, Wenxuan Zhang, Yang Deng, and Wai Lam

PDF

Open Access

TL;DR

This paper introduces a novel approach to extract fashion knowledge from social media posts by transforming the task into caption generation, leveraging multimodal pre-trained models and a new annotated dataset.

Contribution

The work reformulates fashion knowledge extraction as a captioning task using multimodal pre-trained models and creates a new dataset with manual annotations for social media posts.

Findings

01

The proposed model effectively generates fashion knowledge captions from social media posts.

02

The multimodal pre-trained model outperforms traditional classification-based methods.

03

Auxiliary tasks enhance the accuracy of knowledge extraction.

Abstract

Social media plays a significant role in boosting the fashion industry, where a massive amount of fashion-related posts are generated every day. In order to obtain the rich fashion information from the posts, we study the task of social media fashion knowledge extraction. Fashion knowledge, which typically consists of the occasion, person attributes, and fashion item information, can be effectively represented as a set of tuples. Most previous studies on fashion knowledge extraction are based on the fashion product images without considering the rich text information in social media posts. Existing work on fashion knowledge extraction in social media is classification-based and requires to manually determine a set of fashion knowledge categories in advance. In our work, we propose to cast the task as a captioning problem to capture the interplay of the multimodal post information.…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsGenerative Adversarial Networks and Image Synthesis · Digital Media and Visual Art