Data Preprocessing for Evaluation of Recommendation Models in E-Commerce
Namrata Chaudhary, Drimik Roy Chowdhury

TL;DR
This paper presents a comprehensive data preprocessing methodology for evaluating recommendation models in e-commerce, addressing issues from customer behavior to user interface to improve accuracy in measuring recommendation impact.
Contribution
It introduces novel strategies for outlier removal, data collection, and UI analysis to enhance the validity of recommendation evaluation metrics.
Findings
Improved accuracy in measuring recommendation influence on customer actions.
Effective outlier removal techniques for e-commerce data.
Robust methods for tracking visitors and attributing purchases.
Abstract
E-commerce businesses employ recommender models to assist in identifying a personalized set of products for each visitor. To accurately assess the recommendations' influence on customer clicks and buys, three target areas -- customer behavior, data collection, user-interface -- will be explored for possible sources of erroneous data. Varied customer behavior misrepresents the recommendations' true influence on a customer due to the presence of B2B interactions and outlier customers. Non-parametric statistical procedures for outlier removal are delineated and other strategies are investigated to account for the effect of a large percentage of new customers or high bounce rates. Subsequently, in data collection we identify probable misleading interactions in the raw data, propose a robust method of tracking unique visitors, and accurately attributing the buy influence for combo products.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
