IDEAS home Printed from https://ideas.repec.org/p/arx/papers/2504.16789.html
   My bibliography  Save this paper

MLOps Monitoring at Scale for Digital Platforms

Author

Listed:
  • Yu Jeffrey Hu
  • Jeroen Rombouts
  • Ines Wilms

Abstract

Machine learning models are widely recognized for their strong performance in forecasting. To keep that performance in streaming data settings, they have to be monitored and frequently re-trained. This can be done with machine learning operations (MLOps) techniques under supervision of an MLOps engineer. However, in digital platform settings where the number of data streams is typically large and unstable, standard monitoring becomes either suboptimal or too labor intensive for the MLOps engineer. As a consequence, companies often fall back on very simple worse performing ML models without monitoring. We solve this problem by adopting a design science approach and introducing a new monitoring framework, the Machine Learning Monitoring Agent (MLMA), that is designed to work at scale for any ML model with reasonable labor cost. A key feature of our framework concerns test-based automated re-training based on a data-adaptive reference loss batch. The MLOps engineer is kept in the loop via key metrics and also acts, pro-actively or retrospectively, to maintain performance of the ML model in the production stage. We conduct a large-scale test at a last-mile delivery platform to empirically validate our monitoring framework.

Suggested Citation

  • Yu Jeffrey Hu & Jeroen Rombouts & Ines Wilms, 2025. "MLOps Monitoring at Scale for Digital Platforms," Papers 2504.16789, arXiv.org.
  • Handle: RePEc:arx:papers:2504.16789
    as

    Download full text from publisher

    File URL: http://arxiv.org/pdf/2504.16789
    File Function: Latest version
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. M. Keith Chen & Judith A. Chevalier & Peter E. Rossi & Emily Oehlsen, 2019. "The Value of Flexible Work: Evidence from Uber Drivers," Journal of Political Economy, University of Chicago Press, vol. 127(6), pages 2735-2794.
    2. Mirko Kremer & Brent Moritz & Enno Siemsen, 2011. "Demand Forecasting Behavior: System Neglect and Change Detection," Management Science, INFORMS, vol. 57(10), pages 1827-1843, October.
    3. Hansen, Lars Peter, 1982. "Large Sample Properties of Generalized Method of Moments Estimators," Econometrica, Econometric Society, vol. 50(4), pages 1029-1054, July.
    4. Barbara Rossi, 2021. "Forecasting in the Presence of Instabilities: How We Know Whether Models Predict Well and How to Improve Them," Journal of Economic Literature, American Economic Association, vol. 59(4), pages 1135-1190, December.
    5. Andreas Fügener & Jörn Grahl & Alok Gupta & Wolfgang Ketter, 2022. "Cognitive Challenges in Human–Artificial Intelligence Collaboration: Investigating the Path Toward Productive Delegation," Information Systems Research, INFORMS, vol. 33(2), pages 678-696, June.
    6. Raffaella Giacomini & Barbara Rossi, 2009. "Detecting and Predicting Forecast Breakdowns," The Review of Economic Studies, Review of Economic Studies Ltd, vol. 76(2), pages 669-705.
    7. Lan Luo & Ling Zhou & Peter X.-K. Song, 2023. "Real-Time Regression Analysis of Streaming Clustered Data With Possible Abnormal Data Batches," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 118(543), pages 2029-2044, July.
    8. Bowen Gang & Wenguang Sun & Weinan Wang, 2023. "Structure–Adaptive Sequential Testing for Online False Discovery Rate Control," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 118(541), pages 732-745, January.
    9. Makridakis, Spyros & Spiliotis, Evangelos & Assimakopoulos, Vassilios, 2022. "The M5 competition: Background, organization, and implementation," International Journal of Forecasting, Elsevier, vol. 38(4), pages 1325-1336.
    10. Jiankun Sun & Dennis J. Zhang & Haoyuan Hu & Jan A. Van Mieghem, 2022. "Predicting Human Discretion to Adjust Algorithmic Prescription: A Large-Scale Field Experiment in Warehouse Operations," Management Science, INFORMS, vol. 68(2), pages 846-865, February.
    11. Holger Dette & Josua Gösmann, 2020. "A Likelihood Ratio Approach to Sequential Change Point Detection for a General Class of Parameters," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 115(531), pages 1361-1377, July.
    12. Helfat, Constance E. & Raubitschek, Ruth S., 2018. "Dynamic and integrative capabilities for profiting from innovation in digital platform-based ecosystems," Research Policy, Elsevier, vol. 47(8), pages 1391-1399.
    13. Lan Luo & Peter X.‐K. Song, 2020. "Renewable estimation and incremental inference in generalized linear models with streaming data sets," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 82(1), pages 69-97, February.
    14. Weiguang Wang & Guodong (Gordon) Gao & Ritu Agarwal, 2024. "Friend or Foe? Teaming Between Artificial Intelligence and Workers with Variation in Experience," Management Science, INFORMS, vol. 70(9), pages 5753-5775, September.
    15. Bojer, Casper Solheim & Meldgaard, Jens Peder, 2021. "Kaggle forecasting competitions: An overlooked learning opportunity," International Journal of Forecasting, Elsevier, vol. 37(2), pages 587-603.
    16. Gordon Burtch & Seth Carnahan & Brad N. Greenwood, 2018. "Can You Gig It? An Empirical Examination of the Gig Economy and Entrepreneurial Activity," Management Science, INFORMS, vol. 64(12), pages 5497-5520, December.
    17. Jin-Ting Zhang & Jia Guo & Bu Zhou & Ming-Yen Cheng, 2020. "A Simple Two-Sample Test in High Dimensions Based on L2-Norm," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 115(530), pages 1011-1027, April.
    18. Y. Mei, 2010. "Efficient scalable schemes for monitoring a large number of data streams," Biometrika, Biometrika Trust, vol. 97(2), pages 419-433.
    19. Simona Abis & Laura Veldkamp, 2024. "The Changing Economics of Knowledge Production," The Review of Financial Studies, Society for Financial Studies, vol. 37(1), pages 89-118.
    20. Sturm, Timo & Gerlach, Jin & Pumplun, Luisa & Mesbah, Neda & Peters, Felix & Tauchert, Christoph & Nan, Ning & Buxmann, Peter, 2021. "Coordinating Human and Machine Learning for Effective Organizational Learning," Publications of Darmstadt Technical University, Institute for Business Studies (BWL) 125653, Darmstadt Technical University, Department of Business Administration, Economics and Law, Institute for Business Studies (BWL).
    21. Tamer Boyacı & Caner Canyakmaz & Francis de Véricourt, 2024. "Human and Machine: The Impact of Machine Input on Decision Making Under Cognitive Limitations," Management Science, INFORMS, vol. 70(2), pages 1258-1275, February.
    22. Chu, Chia-Shang James & Stinchcombe, Maxwell & White, Halbert, 1996. "Monitoring Structural Change," Econometrica, Econometric Society, vol. 64(5), pages 1045-1065, September.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Yu Jeffrey Hu & Jeroen Rombouts & Ines Wilms, 2023. "Fast Forecasting of Unstable Data Streams for On-Demand Service Platforms," Papers 2303.01887, arXiv.org, revised May 2024.
    2. Ivanov, Dmitry, 2023. "Intelligent digital twin (iDT) for supply chain stress-testing, resilience, and viability," International Journal of Production Economics, Elsevier, vol. 263(C).
    3. Spiliotis, Evangelos & Petropoulos, Fotios, 2024. "On the update frequency of univariate forecasting models," European Journal of Operational Research, Elsevier, vol. 314(1), pages 111-121.
    4. Sungyong Um & Bin Zhang & Sunil Wattal & Youngjin Yoo, 2023. "Software Components and Product Variety in a Platform Ecosystem: A Dynamic Network Analysis of WordPress," Information Systems Research, INFORMS, vol. 34(4), pages 1339-1374, December.
    5. Barrios, John M. & Hochberg, Yael V. & Yi, Hanyi, 2022. "Launching with a parachute: The gig economy and new business formation," Journal of Financial Economics, Elsevier, vol. 144(1), pages 22-43.
    6. Yongwook Paik & Christos A. Makridis, 2023. "The social value of a ridesharing platform: a hedonic pricing approach," Empirical Economics, Springer, vol. 64(5), pages 2125-2150, May.
    7. Yudong Chen & Tengyao Wang & Richard J. Samworth, 2022. "High‐dimensional, multiscale online changepoint detection," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 84(1), pages 234-266, February.
    8. Clark, Todd & McCracken, Michael, 2013. "Advances in Forecast Evaluation," Handbook of Economic Forecasting, in: G. Elliott & C. Granger & A. Timmermann (ed.), Handbook of Economic Forecasting, edition 1, volume 2, chapter 0, pages 1107-1201, Elsevier.
    9. Makridakis, Spyros & Spiliotis, Evangelos & Assimakopoulos, Vassilios & Chen, Zhi & Gaba, Anil & Tsetlin, Ilia & Winkler, Robert L., 2022. "The M5 uncertainty competition: Results, findings and conclusions," International Journal of Forecasting, Elsevier, vol. 38(4), pages 1365-1385.
    10. Weiguang Wang & Guodong (Gordon) Gao & Ritu Agarwal, 2024. "Friend or Foe? Teaming Between Artificial Intelligence and Workers with Variation in Experience," Management Science, INFORMS, vol. 70(9), pages 5753-5775, September.
    11. Mikkel Bennedsen, 2021. "Designing a statistical procedure for monitoring global carbon dioxide emissions," Climatic Change, Springer, vol. 166(3), pages 1-19, June.
    12. Chen, Yudong & Wang, Tengyao & Samworth, Richard J., 2022. "High-dimensional, multiscale online changepoint detection," LSE Research Online Documents on Economics 113665, London School of Economics and Political Science, LSE Library.
    13. Minkyu Shin & Jiwoong Shin & Soheil Ghili & Jaehwan Kim, 2023. "The Impact of the Gig Economy on Product Quality Through the Labor Market: Evidence from Ridesharing and Restaurant Quality," Management Science, INFORMS, vol. 69(5), pages 2620-2638, May.
    14. Jin Liu & Xingchen Xu & Xi Nan & Yongjun Li & Yong Tan, 2023. ""Generate" the Future of Work through AI: Empirical Evidence from Online Labor Markets," Papers 2308.05201, arXiv.org, revised Jun 2024.
    15. Ni Huang & Gordon Burtch & Yili Hong & Paul A. Pavlou, 2020. "Unemployment and Worker Participation in the Gig Economy: Evidence from an Online Labor Market," Information Systems Research, INFORMS, vol. 31(2), pages 431-448, June.
    16. Qi Zheng & Jing Zhan & Xinying Xu, 2024. "Platform Training and Learning by Doing and Gig Workers’ Incomes: Empirical Evidence From China’s Food Delivery Riders," SAGE Open, , vol. 14(3), pages 21582440241, September.
    17. Dawei (David) Zhang & Gang Peng & Yuliang Yao & Tyson R. Browning, 2024. "Is a College Education Still Enough? The IT-Labor Relationship with Education Level, Task Routineness, and Artificial Intelligence," Information Systems Research, INFORMS, vol. 35(3), pages 992-1010, September.
    18. Chiu, Ching-Wai (Jeremy) & Hayes, Simon & Kapetanios, George & Theodoridis, Konstantinos, 2019. "A new approach for detecting shifts in forecast accuracy," International Journal of Forecasting, Elsevier, vol. 35(4), pages 1596-1612.
    19. Yiyuan Ma & Ke Chen & Youzhi Xiao & Rong Fan, 2022. "Does Online Ride-Hailing Service Improve the Efficiency of Taxi Market? Evidence from Shanghai," Sustainability, MDPI, vol. 14(14), pages 1-16, July.
    20. Teltser, Keith & Lennon, Conor & Burgdorf, Jacob, 2021. "Do ridesharing services increase alcohol consumption?," Journal of Health Economics, Elsevier, vol. 77(C).

    More about this item

    NEP fields

    This paper has been announced in the following NEP Reports:

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:arx:papers:2504.16789. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: arXiv administrators (email available below). General contact details of provider: http://arxiv.org/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.