IDEAS home Printed from https://ideas.repec.org/a/eee/proeco/v279y2025ics0925527324002664.html
   My bibliography  Save this article

The big data newsvendor problem under demand and yield uncertainties

Author

Listed:
  • Cao, Tiantian
  • Yang, Yi
  • Zhu, Han
  • Yu, Mingyue

Abstract

We consider a variant of the classic newsvendor problem in which the firms face both demand and yield randomness. Different from the existing literature, we assume that decision-makers have no priori knowledge of the distribution functions of demand and yield, but have access to past observations of demand, yield, and related feature information. We integrate predictive machine learning algorithms to determine the optimal order quantity directly from historical data, respectively based on the empirical risk minimization (ERM) principle, kernel regression approach, K-nearest neighbors (kNN), and classification and regression trees (CART). These data-driven approaches can not only sufficiently capture useful information from relevant features, but also take into account the structure of the optimization problem, which can effectively avoid inconsistency solutions in the traditional “prediction-then-optimization” approach. Most importantly, we establish out-of-sample generalization error bounds under mild conditions using uniform stability-based and Rademacher complexity-based methods in computational learning theory and then show the asymptotic optimality of the data-driven approaches based on kernel regression and kNN. Our data-driven approaches can tractably deal with both independent and interdependent demand and yield uncertainties. Finally, numerical experiments based on both synthetic data and real data are conducted to compare our proposed methods with two traditional benchmark approaches, including the Sample Average Approximation (SAA) approach and the traditional “Predict-then-Optimize” framework based on CART. We observe that our data-driven approaches can achieve significant performance improvement and the one based on the kernel regression method tends to perform the best on real data, with an average daily cost saving of up tp 54.92%.

Suggested Citation

  • Cao, Tiantian & Yang, Yi & Zhu, Han & Yu, Mingyue, 2025. "The big data newsvendor problem under demand and yield uncertainties," International Journal of Production Economics, Elsevier, vol. 279(C).
  • Handle: RePEc:eee:proeco:v:279:y:2025:i:c:s0925527324002664
    DOI: 10.1016/j.ijpe.2024.109409
    as

    Download full text from publisher

    File URL: http://www.sciencedirect.com/science/article/pii/S0925527324002664
    Download Restriction: Full text for ScienceDirect subscribers only

    File URL: https://libkey.io/10.1016/j.ijpe.2024.109409?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Qi Feng & J. George Shanthikumar, 2018. "Supply and Demand Functions in Inventory Models," Operations Research, INFORMS, vol. 66(1), pages 77-91, 1-2.
    2. Zhang, Jie & Xie, Weijun & Sarin, Subhash C., 2021. "Robust multi-product newsvendor model with uncertain demand and substitution," European Journal of Operational Research, Elsevier, vol. 293(1), pages 190-202.
    3. Beutel, Anna-Lena & Minner, Stefan, 2012. "Safety stock planning under causal demand forecasting," International Journal of Production Economics, Elsevier, vol. 140(2), pages 637-645.
    4. Inderfurth, Karl & Vogelgesang, Stephanie, 2013. "Concepts for safety stock determination under stochastic demand and different types of random production yield," European Journal of Operational Research, Elsevier, vol. 224(2), pages 293-301.
    5. Inderfurth, K. & Kiesmüller, G.P., 2015. "Exact and heuristic linear-inflation policies for an inventory model with random yield and arbitrary lead times," European Journal of Operational Research, Elsevier, vol. 245(1), pages 109-120.
    6. Zhi Chen & Weijun Xie, 2021. "Regret in the Newsvendor Model with Demand and Yield Randomness," Production and Operations Management, Production and Operations Management Society, vol. 30(11), pages 4176-4197, November.
    7. Ravi Anupindi & Ram Akella, 1993. "Diversification Under Supply Uncertainty," Management Science, INFORMS, vol. 39(8), pages 944-963, August.
    8. Retsef Levi & Georgia Perakis & Joline Uichanco, 2015. "The Data-Driven Newsvendor Problem: New Bounds and Insights," Operations Research, INFORMS, vol. 63(6), pages 1294-1306, December.
    9. Maqbool Dada & Nicholas C. Petruzzi & Leroy B. Schwarz, 2007. "A Newsvendor's Procurement Problem when Suppliers Are Unreliable," Manufacturing & Service Operations Management, INFORMS, vol. 9(1), pages 9-32, August.
    10. Yimin Wang & Wendell Gilland & Brian Tomlin, 2010. "Mitigating Supply Risk: Dual Sourcing or Process Improvement?," Manufacturing & Service Operations Management, INFORMS, vol. 12(3), pages 489-510, September.
    11. Nam Ho-Nguyen & Fatma Kılınç-Karzan, 2022. "Risk Guarantees for End-to-End Prediction and Optimization Processes," Management Science, INFORMS, vol. 68(12), pages 8680-8698, December.
    12. Gah-Yi Ban & Cynthia Rudin, 2019. "The Big Data Newsvendor: Practical Insights from Machine Learning," Operations Research, INFORMS, vol. 67(1), pages 90-108, January.
    13. Mengshi Lu & Lun Ran & Zuo-Jun Max Shen, 2015. "Reliable Facility Location Design Under Uncertain Correlated Disruptions," Manufacturing & Service Operations Management, INFORMS, vol. 17(4), pages 445-455, October.
    14. Arthur Hsu & Yehuda Bassok, 1999. "Random Yield and Random Demand in a Production System with Downward Substitution," Operations Research, INFORMS, vol. 47(2), pages 277-290, April.
    15. Sammi Y. Tang & Panos Kouvelis, 2014. "Pay-Back-Revenue-Sharing Contract in Coordinating Supply Chains with Random Yield," Production and Operations Management, Production and Operations Management Society, vol. 23(12), pages 2089-2102, December.
    16. Woonghee Tim Huh & Retsef Levi & Paat Rusmevichientong & James B. Orlin, 2011. "Adaptive Data-Driven Inventory Control with Censored Demand Based on Kaplan-Meier Estimator," Operations Research, INFORMS, vol. 59(4), pages 929-941, August.
    17. Aharon Ben-Tal & Dick den Hertog & Anja De Waegenaere & Bertrand Melenberg & Gijs Rennen, 2013. "Robust Solutions of Optimization Problems Affected by Uncertain Probabilities," Management Science, INFORMS, vol. 59(2), pages 341-357, April.
    18. Anssi Käki & Juuso Liesiö & Ahti Salo & Srinivas Talluri, 2015. "Newsvendor decisions under supply uncertainty," International Journal of Production Research, Taylor & Francis Journals, vol. 53(5), pages 1544-1560, March.
    19. Dimitris Bertsimas & Nathan Kallus, 2020. "From Predictive to Prescriptive Analytics," Management Science, INFORMS, vol. 66(3), pages 1025-1044, March.
    20. Candace Arai Yano & Hau L. Lee, 1995. "Lot Sizing with Random Yields: A Review," Operations Research, INFORMS, vol. 43(2), pages 311-334, April.
    21. Weidong Chen & Cong Shi & Izak Duenyas, 2020. "Optimal Learning Algorithms for Stochastic Inventory Systems with Random Capacities," Production and Operations Management, Production and Operations Management Society, vol. 29(7), pages 1624-1649, July.
    22. Woonghee Tim Huh & Paat Rusmevichientong, 2009. "A Nonparametric Asymptotic Analysis of Inventory Planning with Censored Demand," Mathematics of Operations Research, INFORMS, vol. 34(1), pages 103-123, February.
    23. Erick Delage & Yinyu Ye, 2010. "Distributionally Robust Optimization Under Moment Uncertainty with Application to Data-Driven Problems," Operations Research, INFORMS, vol. 58(3), pages 595-612, June.
    24. Retsef Levi & Robin O. Roundy & David B. Shmoys, 2007. "Provably Near-Optimal Sampling-Based Policies for Stochastic Inventory Control Models," Mathematics of Operations Research, INFORMS, vol. 32(4), pages 821-839, November.
    25. Frank W. Ciarallo & Ramakrishna Akella & Thomas E. Morton, 1994. "A Periodic Review, Production Planning Model with Uncertain Capacity and Uncertain Demand---Optimality of Extended Myopic Policies," Management Science, INFORMS, vol. 40(3), pages 320-332, March.
    26. Mordechai Henig & Yigal Gerchak, 1990. "The Structure of Periodic Review Policies in the Presence of Random Yield," Operations Research, INFORMS, vol. 38(4), pages 634-643, August.
    27. Chao, Xiuli & Chen, Hong & Zheng, Shaohui, 2008. "Joint replenishment and pricing decisions in inventory systems with stochastically dependent supply capacity," European Journal of Operational Research, Elsevier, vol. 191(1), pages 142-155, November.
    28. Lingxiu Dong & Xin Geng & Guang Xiao & Nan Yang, 2022. "Procurement Strategies with Unreliable Suppliers Under Correlated Random Yields," Manufacturing & Service Operations Management, INFORMS, vol. 24(1), pages 179-195, January.
    29. Qi Feng, 2010. "Integrating Dynamic Pricing and Replenishment Decisions Under Supply Capacity Uncertainty," Management Science, INFORMS, vol. 56(12), pages 2154-2172, December.
    30. Pascal M. Notz & Richard Pibernik, 2022. "Prescriptive Analytics for Flexible Capacity Management," Management Science, INFORMS, vol. 68(3), pages 1756-1775, March.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Qi Feng & J. George Shanthikumar, 2018. "Supply and Demand Functions in Inventory Models," Operations Research, INFORMS, vol. 66(1), pages 77-91, 1-2.
    2. Wang, Yimin & Xiao, Yixuan & Yang, Nan, 2014. "Improving reliability of a shared supplier with competition and spillovers," European Journal of Operational Research, Elsevier, vol. 236(2), pages 499-510.
    3. Erkip, Nesim Kohen, 2023. "Can accessing much data reshape the theory? Inventory theory under the challenge of data-driven systems," European Journal of Operational Research, Elsevier, vol. 308(3), pages 949-959.
    4. Wen Chen & Burcu Tan, 2022. "Dynamic procurement from multiple suppliers with random capacities," Annals of Operations Research, Springer, vol. 317(2), pages 509-536, October.
    5. Lingxiu Dong & Guang Xiao & Nan Yang, 2023. "Supply diversification under random yield: The impact of price postponement," Production and Operations Management, Production and Operations Management Society, vol. 32(4), pages 1112-1132, April.
    6. Jinzhi Bu & Xiting Gong & Dacheng Yao, 2019. "Technical Note—Constant-Order Policies for Lost-Sales Inventory Models with Random Supply Functions: Asymptotics and Heuristic," Operations Research, INFORMS, vol. 68(4), pages 1063-1073, July.
    7. Pan, Wenting & So, Kut C. & Xiao, Guang, 2022. "Benefits of backup sourcing for components in assembly systems under supply uncertainty," European Journal of Operational Research, Elsevier, vol. 302(1), pages 158-171.
    8. Serrano, Breno & Minner, Stefan & Schiffer, Maximilian & Vidal, Thibaut, 2024. "Bilevel optimization for feature selection in the data-driven newsvendor problem," European Journal of Operational Research, Elsevier, vol. 315(2), pages 703-714.
    9. Serel, Doğan A., 2017. "A single-period stocking and pricing problem involving stochastic emergency supply," International Journal of Production Economics, Elsevier, vol. 185(C), pages 180-195.
    10. Panos Kouvelis & Guang Xiao & Nan Yang, 2021. "Role of Risk Aversion in Price Postponement Under Supply Random Yield," Management Science, INFORMS, vol. 67(8), pages 4826-4844, August.
    11. Xu, He & Zuo, Xiaolu & Liu, Zhixue, 2015. "Configuration of flexibility strategies under supply uncertainty," Omega, Elsevier, vol. 51(C), pages 71-82.
    12. Seung Hwan Jung, 2020. "Offshore versus Onshore Sourcing: Quick Response, Random Yield, and Competition," Production and Operations Management, Production and Operations Management Society, vol. 29(3), pages 750-766, March.
    13. Qi Feng & Zhongjie Ma & Zhaofang Mao & J. George Shanthikumar, 2021. "Multi‐Stage Supply Chain with Production Uncertainty," Production and Operations Management, Production and Operations Management Society, vol. 30(4), pages 921-940, April.
    14. Xiuli Chao & Xiting Gong & Shaohui Zheng, 2016. "Optimal pricing and inventory policies with reliable and random-yield suppliers: characterization and comparison," Annals of Operations Research, Springer, vol. 241(1), pages 35-51, June.
    15. Ming Zhao & Nickolas Freeman & Kai Pan, 2023. "Robust Sourcing Under Multilevel Supply Risks: Analysis of Random Yield and Capacity," INFORMS Journal on Computing, INFORMS, vol. 35(1), pages 178-195, January.
    16. Soroush Saghafian & Brian Tomlin, 2016. "The Newsvendor under Demand Ambiguity: Combining Data with Moment and Tail Information," Operations Research, INFORMS, vol. 64(1), pages 167-185, February.
    17. Zhi Chen & Weijun Xie, 2021. "Regret in the Newsvendor Model with Demand and Yield Randomness," Production and Operations Management, Production and Operations Management Society, vol. 30(11), pages 4176-4197, November.
    18. Hao Yuan & Qi Luo & Cong Shi, 2021. "Marrying Stochastic Gradient Descent with Bandits: Learning Algorithms for Inventory Systems with Fixed Costs," Management Science, INFORMS, vol. 67(10), pages 6089-6115, October.
    19. Kyungchul Park & Kyungsik Lee, 2016. "Distribution-robust single-period inventory control problem with multiple unreliable suppliers," OR Spectrum: Quantitative Approaches in Management, Springer;Gesellschaft für Operations Research e.V., vol. 38(4), pages 949-966, October.
    20. Yang, Cheng-Hu & Wang, Hai-Tang & Ma, Xin & Talluri, Srinivas, 2023. "A data-driven newsvendor problem: A high-dimensional and mixed-frequency method," International Journal of Production Economics, Elsevier, vol. 266(C).

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:proeco:v:279:y:2025:i:c:s0925527324002664. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Catherine Liu (email available below). General contact details of provider: http://www.elsevier.com/locate/ijpe .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.