Modeling Household Online Shopping Demand in the U.S.: A Machine Learning Approach and Comparative Investigation between 2009 and 2017
Limon Barua, Bo Zou, Yan (Joann) Zhou, Yulin Liu

TL;DR
This study develops and compares machine learning models to predict household online shopping demand in the U.S. using 2009 and 2017 data, providing insights into demand drivers at national and city levels.
Contribution
It introduces a systematic ML modeling approach with feature selection and advanced interpretability techniques for analyzing online shopping demand changes over time.
Findings
Identified key variables influencing online shopping demand.
Quantified demand changes between 2009 and 2017.
Provided city-level demand insights for three major U.S. cities.
Abstract
Despite the rapid growth of online shopping and research interest in the relationship between online and in-store shopping, national-level modeling and investigation of the demand for online shopping with a prediction focus remain limited in the literature. This paper differs from prior work and leverages two recent releases of the U.S. National Household Travel Survey (NHTS) data for 2009 and 2017 to develop machine learning (ML) models, specifically gradient boosting machine (GBM), for predicting household-level online shopping purchases. The NHTS data allow for not only conducting nationwide investigation but also at the level of households, which is more appropriate than at the individual level given the connected consumption and shopping needs of members in a household. We follow a systematic procedure for model development including employing Recursive Feature Elimination…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsUrban and Freight Transport Logistics · Transportation Planning and Optimization · Human Mobility and Location-Based Analysis
MethodsEmirates Airlines Office in Dubai
