IDEAS home Printed from https://ideas.repec.org/a/spr/orspec/v43y2021i3d10.1007_s00291-021-00620-5.html
   My bibliography  Save this article

A column-oriented optimization approach for the generation of correlated random vectors

Author

Listed:
  • Jorge A. Sefair

    (Arizona State University)

  • Oscar Guaje

    (Universidad de los Andes)

  • Andrés L. Medaglia

    (Universidad de los Andes)

Abstract

To induce a desired correlation structure among random variables, widely popular simulation software relies upon the method of Iman and Conover (IC). The underlying premise is that the induced Spearman rank correlation is a meaningful way to approximate other correlation measures among the random variables (e.g., Pearson’s correlation). However, as expected, the desired a posteriori correlation structure often deviates from the Spearman correlation structure. Rooted in the same principle of IC, we propose an alternative distribution-free method based on mixed-integer programming to induce a Pearson correlation structure to bivariate or multivariate random vectors. We also extend our distribution-free method to other correlation measures such as Kendall’s coefficient of concordance, Phi correlation coefficient, and relative risk. We illustrate our method in four different contexts: (1) the simulation of a healthcare facility, (2) the analysis of a manufacturing tandem queue, (3) the imputation of correlated missing data in statistical analysis, and (4) the estimation of the budget overrun risk in a construction project. We also explore the limits of our algorithms by conducting extensive experiments using randomly generated data from multiple distributions.

Suggested Citation

  • Jorge A. Sefair & Oscar Guaje & Andrés L. Medaglia, 2021. "A column-oriented optimization approach for the generation of correlated random vectors," OR Spectrum: Quantitative Approaches in Management, Springer;Gesellschaft für Operations Research e.V., vol. 43(3), pages 777-808, September.
  • Handle: RePEc:spr:orspec:v:43:y:2021:i:3:d:10.1007_s00291-021-00620-5
    DOI: 10.1007/s00291-021-00620-5
    as

    Download full text from publisher

    File URL: http://link.springer.com/10.1007/s00291-021-00620-5
    File Function: Abstract
    Download Restriction: Access to the full text of the articles in this series is restricted.

    File URL: https://libkey.io/10.1007/s00291-021-00620-5?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Nasr, Walid W. & Maddah, Bacel, 2015. "Continuous (s, S) policy with MMPP correlated demand," European Journal of Operational Research, Elsevier, vol. 246(3), pages 874-885.
    2. Biswas, Atanu, 2004. "Generating correlated ordinal categorical random samples," Statistics & Probability Letters, Elsevier, vol. 70(1), pages 25-35, October.
    3. Kaeyoung Shin & Raghu Pasupathy, 2010. "An Algorithm for Fast Generation of Bivariate Poisson Random Vectors," INFORMS Journal on Computing, INFORMS, vol. 22(1), pages 81-92, February.
    4. Bahjat F. Qaqish, 2003. "A family of multivariate binary distributions for simulating correlated binary variables with specified marginal means and correlations," Biometrika, Biometrika Trust, vol. 90(2), pages 455-463, June.
    5. Raymond R. Hill & Charles H. Reilly, 2000. "The Effects of Coefficient Correlation Structure in Two-Dimensional Knapsack Problems on Solution Procedure Performance," Management Science, INFORMS, vol. 46(2), pages 302-317, February.
    6. Charles H. Reilly, 2009. "Synthetic Optimization Problem Generation: Show Us the Correlations!," INFORMS Journal on Computing, INFORMS, vol. 21(3), pages 458-467, August.
    7. Kolev, Nikolai & Paiva, Delhi, 2008. "Random sums of exchangeable variables and actuarial applications," Insurance: Mathematics and Economics, Elsevier, vol. 42(1), pages 147-153, February.
    8. Bruce W. Schmeiser & Ram Lal, 1982. "Bivariate Gamma Random Vectors," Operations Research, INFORMS, vol. 30(2), pages 355-374, April.
    9. Qing Xiao, 2017. "Generating correlated random vector involving discrete variables," Communications in Statistics - Theory and Methods, Taylor & Francis Journals, vol. 46(4), pages 1594-1605, February.
    10. Philip M. Lurie & Matthew S. Goldberg, 1998. "An Approximate Method for Sampling Correlated Random Variables from Partially-Specified Distributions," Management Science, INFORMS, vol. 44(2), pages 203-218, February.
    11. Dias, Carlos Tadeu dos Santos & Samaranayaka, Ari & Manly, Bryan, 2008. "On the use of correlated beta random variables with animal population modelling," Ecological Modelling, Elsevier, vol. 215(4), pages 293-300.
    12. Arnab Chakraborty, 2006. "Generating multivariate correlated samples," Computational Statistics, Springer, vol. 21(1), pages 103-119, March.
    13. Yoshiaki Toyoda, 1975. "A Simplified Algorithm for Obtaining Approximate Solutions to Zero-One Programming Problems," Management Science, INFORMS, vol. 21(12), pages 1417-1427, August.
    14. Marco E. Lübbecke & Jacques Desrosiers, 2005. "Selected Topics in Column Generation," Operations Research, INFORMS, vol. 53(6), pages 1007-1023, December.
    15. Shults, Justine, 2017. "Simulating longer vectors of correlated binary random variables via multinomial sampling," Computational Statistics & Data Analysis, Elsevier, vol. 114(C), pages 1-11.
    16. Sefair, Jorge A. & Méndez, Carlos Y. & Babat, Onur & Medaglia, Andrés L. & Zuluaga, Luis F., 2017. "Linear solution schemes for Mean-SemiVariance Project portfolio selection problems: An application in the oil and gas industry," Omega, Elsevier, vol. 68(C), pages 39-48.
    17. C. R. Mitchell & A. S. Paulson & C. A. Beswick, 1977. "The effect of correlated exponential service times on single server tandem queues," Naval Research Logistics Quarterly, John Wiley & Sons, vol. 24(1), pages 95-112, March.
    18. van der Geest, P. A. G., 1998. "An algorithm to generate samples of multi-variate distributions with correlated marginals," Computational Statistics & Data Analysis, Elsevier, vol. 27(3), pages 271-289, May.
    19. Zhang, Yufeng & Khani, Alireza, 2019. "An algorithm for reliable shortest path problem with travel time correlations," Transportation Research Part B: Methodological, Elsevier, vol. 121(C), pages 92-113.
    20. Charles N. Haas, 1999. "On Modeling Correlated Random Variables in Risk Assessment," Risk Analysis, John Wiley & Sons, vol. 19(6), pages 1205-1214, December.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. David Corredor-Montenegro & Nicolás Cabrera & Raha Akhavan-Tabatabaei & Andrés L. Medaglia, 2021. "On the shortest $$\alpha$$ α -reliable path problem," TOP: An Official Journal of the Spanish Society of Statistics and Operations Research, Springer;Sociedad de Estadística e Investigación Operativa, vol. 29(1), pages 287-318, April.
    2. Soumyadip Ghosh & Shane G. Henderson, 2002. "Chessboard Distributions and Random Vectors with Specified Marginals and Covariance Matrix," Operations Research, INFORMS, vol. 50(5), pages 820-834, October.
    3. Sergei Leonov & Bahjat Qaqish, 2020. "Correlated endpoints: simulation, modeling, and extreme correlations," Statistical Papers, Springer, vol. 61(2), pages 741-766, April.
    4. Leo Lopes & Kate Smith-Miles, 2013. "Generating Applicable Synthetic Instances for Branch Problems," Operations Research, INFORMS, vol. 61(3), pages 563-577, June.
    5. Pier Alda FERRARI & Alessandro BARBIERO, 2011. "Generating ordinal data," Departmental Working Papers 2011-38, Department of Economics, Management and Quantitative Methods at Università degli Studi di Milano.
    6. Stanhope, Stephen, 2005. "Case studies in multivariate-to-anything transforms for partially specified random vector generation," Insurance: Mathematics and Economics, Elsevier, vol. 37(1), pages 68-79, August.
    7. Freville, Arnaud, 2004. "The multidimensional 0-1 knapsack problem: An overview," European Journal of Operational Research, Elsevier, vol. 155(1), pages 1-21, May.
    8. Arnaud Fréville & SaÏd Hanafi, 2005. "The Multidimensional 0-1 Knapsack Problem—Bounds and Computational Aspects," Annals of Operations Research, Springer, vol. 139(1), pages 195-227, October.
    9. Huifen Chen, 2001. "Initialization for NORTA: Generation of Random Vectors with Specified Marginals and Correlations," INFORMS Journal on Computing, INFORMS, vol. 13(4), pages 312-331, November.
    10. Reilly, Charles H. & Sapkota, Nabin, 2015. "A family of composite discrete bivariate distributions with uniform marginals for simulating realistic and challenging optimization-problem instances," European Journal of Operational Research, Elsevier, vol. 241(3), pages 642-652.
    11. Lee, Chungmok & Han, Jinil, 2017. "Benders-and-Price approach for electric vehicle charging station location problem under probabilistic travel range," Transportation Research Part B: Methodological, Elsevier, vol. 106(C), pages 130-152.
    12. Pan, Zhengqiang & Balakrishnan, Narayanaswamy, 2011. "Reliability modeling of degradation of products with multiple performance characteristics based on gamma processes," Reliability Engineering and System Safety, Elsevier, vol. 96(8), pages 949-957.
    13. Guosheng Yin & Yu Shen, 2005. "Adaptive Design and Estimation in Randomized Clinical Trials with Correlated Observations," Biometrics, The International Biometric Society, vol. 61(2), pages 362-369, June.
    14. Isabel Martins & Filipe Alvelos & Miguel Constantino, 2012. "A branch-and-price approach for harvest scheduling subject to maximum area restrictions," Computational Optimization and Applications, Springer, vol. 51(1), pages 363-385, January.
    15. Christensen, Tue R.L. & Labbé, Martine, 2015. "A branch-cut-and-price algorithm for the piecewise linear transportation problem," European Journal of Operational Research, Elsevier, vol. 245(3), pages 645-655.
    16. Ogbe, Emmanuel & Li, Xiang, 2017. "A new cross decomposition method for stochastic mixed-integer linear programming," European Journal of Operational Research, Elsevier, vol. 256(2), pages 487-499.
    17. François Clautiaux & Cláudio Alves & José Valério de Carvalho & Jürgen Rietz, 2011. "New Stabilization Procedures for the Cutting Stock Problem," INFORMS Journal on Computing, INFORMS, vol. 23(4), pages 530-545, November.
    18. Y. Malevergne & D. Sornette, 2003. "Testing the Gaussian copula hypothesis for financial assets dependences," Quantitative Finance, Taylor & Francis Journals, vol. 3(4), pages 231-250.
    19. de Lima, Vinícius L. & Alves, Cláudio & Clautiaux, François & Iori, Manuel & Valério de Carvalho, José M., 2022. "Arc flow formulations based on dynamic programming: Theoretical foundations and applications," European Journal of Operational Research, Elsevier, vol. 296(1), pages 3-21.
    20. Omid Shahvari & Rasaratnam Logendran & Madjid Tavana, 2022. "An efficient model-based branch-and-price algorithm for unrelated-parallel machine batching and scheduling problems," Journal of Scheduling, Springer, vol. 25(5), pages 589-621, October.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:spr:orspec:v:43:y:2021:i:3:d:10.1007_s00291-021-00620-5. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.springer.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.