Multimodal Prompt Learning for Product Title Generation with Extremely Limited Labels
Bang Yang, Fenglin Liu, Zheng Li, Qingyu Yin, Chenyu You, Bing Yin,, and Yuexian Zou

TL;DR
This paper introduces a multimodal prompt learning framework that effectively generates product titles for novel items with minimal labeled data, outperforming traditional methods especially in few-shot scenarios.
Contribution
It proposes a novel prompt-based approach leveraging multimodal prompts to generate product titles with extremely limited labels, addressing challenges of understanding product features and style.
Findings
Achieves state-of-the-art results with full labeled data.
Outperforms existing methods in few-shot settings with only 1% labeled data.
Effective in both in-domain and out-of-domain scenarios.
Abstract
Generating an informative and attractive title for the product is a crucial task for e-commerce. Most existing works follow the standard multimodal natural language generation approaches, e.g., image captioning, and employ the large scale of human-labelled datasets to train desirable models. However, for novel products, especially in a different domain, there are few existing labelled data. In this paper, we propose a prompt-based approach, i.e., the Multimodal Prompt Learning framework, to accurately and efficiently generate titles for novel products with limited labels. We observe that the core challenges of novel product title generation are the understanding of novel product characteristics and the generation of titles in a novel writing style. To this end, we build a set of multimodal prompts from different modalities to preserve the corresponding characteristics and writing styles…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Multimodal Machine Learning Applications · Natural Language Processing Techniques
