Multimodal Quasi-AutoRegression: Forecasting the visual popularity of new fashion products
Stefanos I. Papadopoulos, Christos Koutlis, Symeon Papadopoulos,, Ioannis Kompatsiaris

TL;DR
This paper introduces MuQAR, a deep learning model that predicts the visual popularity of new fashion products by combining multimodal features and quasi-autoregressive time series modeling, addressing data scarcity and trend detection.
Contribution
The paper presents MuQAR, a novel multimodal quasi-autoregressive architecture that effectively forecasts fashion product popularity using visual, textual, and categorical data without requiring historical popularity data.
Findings
MuQAR outperforms current state-of-the-art methods by 4.65% in WAPE.
The model achieves a 4.8% improvement in MAE on the VISUELLE dataset.
Extensive ablation confirms the effectiveness of multimodal and quasi-autoregressive components.
Abstract
Estimating the preferences of consumers is of utmost importance for the fashion industry as appropriately leveraging this information can be beneficial in terms of profit. Trend detection in fashion is a challenging task due to the fast pace of change in the fashion industry. Moreover, forecasting the visual popularity of new garment designs is even more demanding due to lack of historical data. To this end, we propose MuQAR, a Multimodal Quasi-AutoRegressive deep learning architecture that combines two modules: (1) a multi-modal multi-layer perceptron processing categorical, visual and textual features of the product and (2) a quasi-autoregressive neural network modelling the "target" time series of the product's attributes along with the "exogenous" time series of all other attributes. We utilize computer vision, image classification and image captioning, for automatically extracting…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsFashion and Cultural Textiles · Color perception and design · Aesthetic Perception and Analysis
MethodsMasked autoencoder
