IDEAS home Printed from https://ideas.repec.org/a/kap/transp/v50y2023i2d10.1007_s11116-021-10250-z.html
   My bibliography  Save this article

Modeling household online shopping demand in the U.S.: a machine learning approach and comparative investigation between 2009 and 2017

Author

Listed:
  • Limon Barua

    (University of Illinois Chicago)

  • Bo Zou

    (University of Illinois Chicago
    University of California)

  • Yan Zhou

    (Argonne National Laboratory)

  • Yulin Liu

    (University of California)

Abstract

Despite the rapid growth of online shopping and research interest in the relationship between online and in-store shopping, national-level modeling and investigation of the demand for online shopping with a prediction focus remain limited in the literature. This paper differs from prior work and leverages two recent releases of the U.S. National Household Travel Survey (NHTS) data for 2009 and 2017 to develop machine learning (ML) models, specifically gradient boosting machine (GBM), for predicting household-level online shopping purchases. The NHTS data allow for not only conducting nationwide investigation but also at the level of households, which is more appropriate than at the individual level given the connected consumption and shopping needs of members in a household. We follow a systematic procedure for model development including employing Recursive Feature Elimination algorithm to select input variables (features) in order to reduce the risk of model overfitting and increase model explainability. Among several ML models, GBM is found to yield the best prediction accuracy. Extensive post-modeling investigation is conducted in a comparative manner between 2009 and 2017, including quantifying the importance of each input variable in predicting online shopping demand, and characterizing value-dependent relationships between demand and the input variables. In doing so, two latest advances in machine learning techniques, namely Shapley value-based feature importance and Accumulated Local Effects plots, are adopted to overcome inherent drawbacks of the popular techniques in current ML modeling. The modeling and investigation are performed at the national level, with a number of findings obtained. The models developed and insights gained can be used for online shopping-related freight demand generation and may also be considered for evaluating the potential impact of relevant policies on online shopping demand.

Suggested Citation

  • Limon Barua & Bo Zou & Yan Zhou & Yulin Liu, 2023. "Modeling household online shopping demand in the U.S.: a machine learning approach and comparative investigation between 2009 and 2017," Transportation, Springer, vol. 50(2), pages 437-476, April.
  • Handle: RePEc:kap:transp:v:50:y:2023:i:2:d:10.1007_s11116-021-10250-z
    DOI: 10.1007/s11116-021-10250-z
    as

    Download full text from publisher

    File URL: http://link.springer.com/10.1007/s11116-021-10250-z
    File Function: Abstract
    Download Restriction: Access to full text is restricted to subscribers.

    File URL: https://libkey.io/10.1007/s11116-021-10250-z?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Strobl, Carolin & Boulesteix, Anne-Laure & Augustin, Thomas, 2007. "Unbiased split selection for classification trees based on the Gini Index," Computational Statistics & Data Analysis, Elsevier, vol. 52(1), pages 483-501, September.
    2. Yu Ding & Huapu Lu, 2017. "The interactions between online shopping and personal activity travel behavior: an analysis with a GPS-based activity travel diary," Transportation, Springer, vol. 44(2), pages 311-324, March.
    3. Farag, Sendy & Schwanen, Tim & Dijst, Martin & Faber, Jan, 2007. "Shopping online and/or in-store? A structural equation model of the relationships between e-shopping and in-store shopping," Transportation Research Part A: Policy and Practice, Elsevier, vol. 41(2), pages 125-141, February.
    4. Regue, Robert & Recker, Will, 2014. "Proactive vehicle routing with inferred demand to solve the bikesharing rebalancing problem," Transportation Research Part E: Logistics and Transportation Review, Elsevier, vol. 72(C), pages 192-209.
    5. Daniel W. Apley & Jingyu Zhu, 2020. "Visualizing the effects of predictor variables in black box supervised learning models," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 82(4), pages 1059-1086, September.
    6. Rodrigue, Jean-Paul, 2020. "The distribution network of Amazon and the footprint of freight digitalization," Journal of Transport Geography, Elsevier, vol. 88(C).
    7. Pauline van den Berg & Astrid Kemperman & Harry Timmermans, 2014. "Social Interaction Location Choice: A Latent Class Modeling Approach," Annals of the American Association of Geographers, Taylor & Francis Journals, vol. 104(5), pages 959-972, September.
    8. Ding, Chuan & Cao, Xinyu (Jason) & Næss, Petter, 2018. "Applying gradient boosting decision trees to examine non-linear effects of the built environment on driving distance in Oslo," Transportation Research Part A: Policy and Practice, Elsevier, vol. 110(C), pages 107-117.
    9. Jesse W.J. Weltevreden & Ton Van Rietbergen, 2007. "E‐Shopping Versus City Centre Shopping: The Role Of Perceived City Centre Attractiveness," Tijdschrift voor Economische en Sociale Geografie, Royal Dutch Geographical Society KNAG, vol. 98(1), pages 68-85, February.
    10. Qing Zhai & Xinyu Cao & Patricia L. Mokhtarian & Feng Zhen, 2017. "The interactions between e-shopping and store shopping in the shopping process for search goods and experience goods," Transportation, Springer, vol. 44(5), pages 885-904, September.
    11. Edward L. Glaeser & Joshua D. Gottlieb, 2006. "Urban Resurgence and the Consumer City," Urban Studies, Urban Studies Journal Limited, vol. 43(8), pages 1275-1299, July.
    12. Zhen, Feng & Du, Xiaojuan & Cao, Jason & Mokhtarian, Patricia L., 2018. "The association between spatial attributes and e-shopping in the shopping process for search goods and experience goods: Evidence from Nanjing," Journal of Transport Geography, Elsevier, vol. 66(C), pages 291-299.
    13. Patricia L. Mokhtarian, 2002. "Telecommunications and Travel: The Case for Complementarity," Journal of Industrial Ecology, Yale University, vol. 6(2), pages 43-57, April.
    14. Lee, Richard J. & Sener, Ipek N. & Mokhtarian, Patricia L. & Handy, Susan L., 2017. "Relationships between the online and in-store shopping frequency of Davis, California residents," Transportation Research Part A: Policy and Practice, Elsevier, vol. 100(C), pages 40-52.
    15. Robert Cervero & Mark Hansen, 2002. "Induced Travel Demand and Induced Road Investment: A Simultaneous Equation Analysis," Journal of Transport Economics and Policy, University of Bath, vol. 36(3), pages 469-490, September.
    16. Nocera, Silvio & Cavallaro, Federico, 2017. "A two-step method to evaluate the Well-To-Wheel carbon efficiency of Urban Consolidation Centres," Research in Transportation Economics, Elsevier, vol. 65(C), pages 44-55.
    17. Ahamed, Tanvir & Zou, Bo & Farazi, Nahid Parvez & Tulabandhula, Theja, 2021. "Deep Reinforcement Learning for Crowdsourced Urban Delivery," Transportation Research Part B: Methodological, Elsevier, vol. 152(C), pages 227-257.
    18. Zhou, Yiwei & Wang, Xiaokun (Cara), 2014. "Explore the relationship between online shopping and shopping trips: An analysis with the 2009 NHTS data," Transportation Research Part A: Policy and Practice, Elsevier, vol. 70(C), pages 1-9.
    19. Xinyu Cao & Zhiyi Xu & Frank Douma, 2012. "The interactions between e-shopping and traditional in-store shopping: an application of structural equations model," Transportation, Springer, vol. 39(5), pages 957-974, September.
    20. Wilde, Parke & Llobrera, Joseph & Ver Ploeg, Michele, 2014. "Population Density, Poverty, and Food Retail Access in the United States: An Empirical Approach," International Food and Agribusiness Management Review, International Food and Agribusiness Management Association, vol. 17(A), pages 1-16, March.
    21. Zou, Bo & Hansen, Mark, 2012. "Flight delays, capacity investment and social welfare under air transport supply-demand equilibrium," Transportation Research Part A: Policy and Practice, Elsevier, vol. 46(6), pages 965-980.
    22. Kafle, Nabin & Zou, Bo & Lin, Jane, 2017. "Design and modeling of a crowdsource-enabled system for urban parcel relay and delivery," Transportation Research Part B: Methodological, Elsevier, vol. 99(C), pages 62-82.
    23. Shi, Kunbo & De Vos, Jonas & Yang, Yongchun & Witlox, Frank, 2019. "Does e-shopping replace shopping trips? Empirical evidence from Chengdu, China," Transportation Research Part A: Policy and Practice, Elsevier, vol. 122(C), pages 21-33.
    24. Zackary B. Hawley, 2012. "Does Urban Density Promote Social Interaction? Evidence from Instrumental Variable Estimation," The Review of Regional Studies, Southern Regional Science Association, vol. 42(3), pages 223-248, Winter.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Shi, Kunbo & De Vos, Jonas & Yang, Yongchun & Witlox, Frank, 2019. "Does e-shopping replace shopping trips? Empirical evidence from Chengdu, China," Transportation Research Part A: Policy and Practice, Elsevier, vol. 122(C), pages 21-33.
    2. Mateos-Mínguez, Paloma & Arranz-López, Aldo & Soria-Lara, Julio A. & Lanzendorf, Martin, 2021. "E-shoppers and multimodal accessibility to in-store retail: An analysis of spatial and social effects," Journal of Transport Geography, Elsevier, vol. 96(C).
    3. Shah, Harsh & Carrel, Andre L. & Le, Huyen T.K., 2021. "What is your shopping travel style? Heterogeneity in US households’ online shopping and travel," Transportation Research Part A: Policy and Practice, Elsevier, vol. 153(C), pages 83-98.
    4. Kunbo Shi & Long Cheng & Jonas De Vos & Yongchun Yang & Wanpeng Cao & Frank Witlox, 2021. "How does purchasing intangible services online influence the travel to consume these services? A focus on a Chinese context," Transportation, Springer, vol. 48(5), pages 2605-2625, October.
    5. Shi, Kunbo & De Vos, Jonas & Cheng, Long & Yang, Yongchun & Witlox, Frank, 2021. "The influence of the built environment on online purchases of intangible services: Examining the mediating role of online purchase attitudes," Transport Policy, Elsevier, vol. 114(C), pages 116-126.
    6. Li, Shengxiao (Alex), 2023. "Revisiting the relationship between information and communication technologies and travel behavior: An investigation of older Americans," Transportation Research Part A: Policy and Practice, Elsevier, vol. 172(C).
    7. Jing Chen & Yong Zhang & Shiyao Zhu & Lei Liu, 2021. "Does COVID-19 Affect the Behavior of Buying Fresh Food? Evidence from Wuhan, China," IJERPH, MDPI, vol. 18(9), pages 1-15, April.
    8. Figliozzi, Miguel & Unnikrishnan, Avinash, 2021. "Exploring the impact of socio-demographic characteristics, health concerns, and product type on home delivery rates and expenditures during a strict COVID-19 lockdown period: A case study from Portlan," Transportation Research Part A: Policy and Practice, Elsevier, vol. 153(C), pages 1-19.
    9. Shi, Kunbo & Shao, Rui & De Vos, Jonas & Cheng, Long & Witlox, Frank, 2021. "Is e-shopping likely to reduce shopping trips for car owners? A propensity score matching analysis," Journal of Transport Geography, Elsevier, vol. 95(C).
    10. Minh Hieu Nguyen & Jimmy Armoogum & Binh Nguyen Thi, 2021. "Factors Affecting the Growth of E-Shopping over the COVID-19 Era in Hanoi, Vietnam," Sustainability, MDPI, vol. 13(16), pages 1-21, August.
    11. Lee, Richard J. & Sener, Ipek N. & Mokhtarian, Patricia L. & Handy, Susan L., 2017. "Relationships between the online and in-store shopping frequency of Davis, California residents," Transportation Research Part A: Policy and Practice, Elsevier, vol. 100(C), pages 40-52.
    12. Colaço, Rui & de Abreu e Silva, João, 2022. "Exploring the e-shopping geography of Lisbon: Assessing online shopping adoption for retail purchases and food deliveries using a 7-day shopping survey," Journal of Retailing and Consumer Services, Elsevier, vol. 65(C).
    13. Xi, Guangliang & Cao, Xinyu & Zhen, Feng, 2020. "The impacts of same day delivery online shopping on local store shopping in Nanjing, China," Transportation Research Part A: Policy and Practice, Elsevier, vol. 136(C), pages 35-47.
    14. Lavieri, Patrícia S. & Dai, Qichun & Bhat, Chandra R., 2018. "Using virtual accessibility and physical accessibility as joint predictors of activity-travel behavior," Transportation Research Part A: Policy and Practice, Elsevier, vol. 118(C), pages 527-544.
    15. Yu Ding & Huapu Lu, 2017. "The interactions between online shopping and personal activity travel behavior: an analysis with a GPS-based activity travel diary," Transportation, Springer, vol. 44(2), pages 311-324, March.
    16. Comi, Antonio, 2020. "A modelling framework to forecast urban goods flows," Research in Transportation Economics, Elsevier, vol. 80(C).
    17. Zhen, Feng & Du, Xiaojuan & Cao, Jason & Mokhtarian, Patricia L., 2018. "The association between spatial attributes and e-shopping in the shopping process for search goods and experience goods: Evidence from Nanjing," Journal of Transport Geography, Elsevier, vol. 66(C), pages 291-299.
    18. Orit Rotem-Mindali & Jesse Weltevreden, 2013. "Transport effects of e-commerce: what can be learned after years of research?," Transportation, Springer, vol. 40(5), pages 867-885, September.
    19. Shi, Yishao & Tao, Tianhui & Cao, Xiangyang & Pei, Xiaowen, 2021. "The association between spatial attributes and neighborhood characteristics based on Meituan take-out data: Evidence from shanghai business circles," Journal of Retailing and Consumer Services, Elsevier, vol. 58(C).
    20. Beckers, Joris & Cárdenas, Ivan & Verhetsel, Ann, 2018. "Identifying the geography of online shopping adoption in Belgium," Journal of Retailing and Consumer Services, Elsevier, vol. 45(C), pages 33-41.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:kap:transp:v:50:y:2023:i:2:d:10.1007_s11116-021-10250-z. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.springer.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.