IDEAS home Printed from https://ideas.repec.org/a/inm/orijds/v5y2026i1p65-80.html

An Agglomerative Clustering Algorithm for Simulation Output Distributions Using Regularized Wasserstein Distance

Author

Listed:
  • Mohammadmahdi Ghasemloo

    (Department of Industrial and Systems Engineering, Texas A&M University, College Station, Texas 77843)

  • David J. Eckman

    (Department of Industrial and Systems Engineering, Texas A&M University, College Station, Texas 77843)

Abstract

Using statistical learning methods to analyze stochastic simulation outputs can significantly enhance decision making by uncovering relationships among different simulated systems and between a system’s inputs and outputs. We present a novel agglomerative clustering algorithm that utilizes the regularized Wasserstein distance to cluster multivariate empirical distributions of simulation outputs to identify patterns and trade-offs among performance measures. This framework has several important use cases, including anomaly detection, preoptimization, and online monitoring. In numerical experiments involving a call center model, we demonstrate how this methodology can identify staffing plans that yield similar performance outcomes and inform policies for intervening when queue lengths signal potentially worsening system performance.

Suggested Citation

  • Mohammadmahdi Ghasemloo & David J. Eckman, 2026. "An Agglomerative Clustering Algorithm for Simulation Output Distributions Using Regularized Wasserstein Distance," INFORMS Joural on Data Science, INFORMS, vol. 5(1), pages 65-80, January.
  • Handle: RePEc:inm:orijds:v:5:y:2026:i:1:p:65-80
    DOI: 10.1287/ijds.2024.0056
    as

    Download full text from publisher

    File URL: http://dx.doi.org/10.1287/ijds.2024.0056
    Download Restriction: no

    File URL: https://libkey.io/10.1287/ijds.2024.0056?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Abdallah, Imad & Tatsis, Konstantinos & Chatzi, Eleni, 2020. "Unsupervised local cluster-weighted bootstrap aggregating the output from multiple stochastic simulators," Reliability Engineering and System Safety, Elsevier, vol. 199(C).
    2. Lawrence Hubert & Phipps Arabie, 1985. "Comparing partitions," Journal of Classification, Springer;The Classification Society, vol. 2(1), pages 193-218, December.
    3. Haidong Li & Henry Lam & Yijie Peng, 2024. "Efficient Learning for Clustering and Optimizing Context-Dependent Designs," Operations Research, INFORMS, vol. 72(2), pages 617-638, March.
    4. R. Cont, 2001. "Empirical properties of asset returns: stylized facts and statistical issues," Quantitative Finance, Taylor & Francis Journals, vol. 1(2), pages 223-236.
    5. Chakraborty, Saptarshi & Paul, Debolina & Das, Swagatam, 2020. "Hierarchical clustering with optimal transport," Statistics & Probability Letters, Elsevier, vol. 163(C).
    6. B L Nelson, 2016. "‘Some tactical problems in digital simulation’ for the next 10 years," Journal of Simulation, Taylor & Francis Journals, vol. 10(1), pages 2-11, February.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Nicoló Musmeci & Tomaso Aste & T Di Matteo, 2015. "Relation between Financial Market Structure and the Real Economy: Comparison between Clustering Methods," PLOS ONE, Public Library of Science, vol. 10(3), pages 1-24, March.
    2. Nicolo Musmeci & Tomaso Aste & Tiziana Di Matteo, 2014. "Relation between Financial Market Structure and the Real Economy: Comparison between Clustering Methods," Papers 1406.0496, arXiv.org, revised Jan 2015.
    3. Andrea Di Iura, 2022. "Comparison of empirical and shrinkage correlation algorithm for clustering methods in the futures market," SN Business & Economics, Springer, vol. 2(8), pages 1-17, August.
    4. Musmeci, Nicoló & Aste, Tomaso & Di Matteo, T., 2015. "Relation between financial market structure and the real economy: comparison between clustering methods," LSE Research Online Documents on Economics 61644, London School of Economics and Political Science, LSE Library.
    5. Abduraimova, Kumushoy, 2022. "Contagion and tail risk in complex financial networks," Journal of Banking & Finance, Elsevier, vol. 143(C).
    6. Josselin Garnier & Knut Sølna, 2018. "Option pricing under fast-varying and rough stochastic volatility," Annals of Finance, Springer, vol. 14(4), pages 489-516, November.
    7. Pieter Nel & Renee van Eyden, 2026. "From News to Noise: Does Media Sentiment Drive Stock Market Volatility?," Working Papers 202605, University of Pretoria, Department of Economics.
    8. Juan C. Henao-Londono & Sebastian M. Krause & Thomas Guhr, 2021. "Price response functions and spread impact in correlated financial markets," The European Physical Journal B: Condensed Matter and Complex Systems, Springer;EDP Sciences, vol. 94(4), pages 1-20, April.
    9. Eduardo Abi Jaber, 2022. "The characteristic function of Gaussian stochastic volatility models: an analytic expression," Working Papers hal-02946146, HAL.
    10. Peter Yegon & W. Brent Lindquist & Svetlozar T. Rachev, 2025. "Asset Pricing in the Presence of Market Microstructure Noise," Papers 2511.00308, arXiv.org.
    11. Wu, Han-Ming & Tien, Yin-Jing & Chen, Chun-houh, 2010. "GAP: A graphical environment for matrix visualization and cluster analysis," Computational Statistics & Data Analysis, Elsevier, vol. 54(3), pages 767-778, March.
    12. José E. Chacón, 2021. "Explicit Agreement Extremes for a 2 × 2 Table with Given Marginals," Journal of Classification, Springer;The Classification Society, vol. 38(2), pages 257-263, July.
    13. Giovanni Bonaccolto & Massimiliano Caporin & Sandra Paterlini, 2018. "Asset allocation strategies based on penalized quantile regression," Computational Management Science, Springer, vol. 15(1), pages 1-32, January.
    14. Zhen-Hao Guo & De-Shuang Huang & Shihua Zhang, 2026. "Multi-species integration, alignment and annotation of single-cell RNA-seq data with CAMEX," Nature Communications, Nature, vol. 17(1), pages 1-17, December.
    15. Eduardo Abi Jaber & Donatien Hainaut & Edouard Motte, 2025. "The Volterra Stein-Stein model with stochastic interest rates," Papers 2503.01716, arXiv.org, revised Jul 2025.
    16. Lallouache, Mehdi & Abergel, Frédéric, 2014. "Tick size reduction and price clustering in a FX order book," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 416(C), pages 488-498.
    17. Muchnik, Lev & Bunde, Armin & Havlin, Shlomo, 2009. "Long term memory in extreme returns of financial time series," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 388(19), pages 4145-4150.
    18. BenSaïda, Ahmed & Slim, Skander, 2016. "Highly flexible distributions to fit multiple frequency financial returns," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 442(C), pages 203-213.
    19. Roberto Rocci & Stefano Antonio Gattone & Roberto Di Mari, 2018. "A data driven equivariant approach to constrained Gaussian mixture modeling," Advances in Data Analysis and Classification, Springer;German Classification Society - Gesellschaft für Klassifikation (GfKl);Japanese Classification Society (JCS);Classification and Data Analysis Group of the Italian Statistical Society (CLADAG);International Federation of Classification Societies (IFCS), vol. 12(2), pages 235-260, June.
    20. Chen, Zhimin & Ibragimov, Rustam, 2019. "One country, two systems? The heavy-tailedness of Chinese A- and H- share markets," Emerging Markets Review, Elsevier, vol. 38(C), pages 115-141.

    More about this item

    Keywords

    ;
    ;
    ;

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:inm:orijds:v:5:y:2026:i:1:p:65-80. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Chris Asher (email available below). General contact details of provider: https://edirc.repec.org/data/inforea.html .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.