IDEAS home Printed from https://ideas.repec.org/a/spr/comaot/v29y2023i1d10.1007_s10588-021-09351-y.html
   My bibliography  Save this article

Explaining and predicting human behavior and social dynamics in simulated virtual worlds: reproducibility, generalizability, and robustness of causal discovery methods

Author

Listed:
  • Svitlana Volkova

    (Pacific Northwest National Laboratory)

  • Dustin Arendt

    (Pacific Northwest National Laboratory)

  • Emily Saldanha

    (Pacific Northwest National Laboratory)

  • Maria Glenski

    (Pacific Northwest National Laboratory)

  • Ellyn Ayton

    (Pacific Northwest National Laboratory)

  • Joseph Cottam

    (Pacific Northwest National Laboratory)

  • Sinan Aksoy

    (Pacific Northwest National Laboratory)

  • Brett Jefferson

    (Pacific Northwest National Laboratory)

  • Karthnik Shrivaram

    (Pacific Northwest National Laboratory)

Abstract

Ground Truth program was designed to evaluate social science modeling approaches using simulation test beds with ground truth intentionally and systematically embedded to understand and model complex Human Domain systems and their dynamics Lazer et al. (Science 369:1060–1062, 2020). Our multidisciplinary team of data scientists, statisticians, experts in Artificial Intelligence (AI) and visual analytics had a unique role on the program to investigate accuracy, reproducibility, generalizability, and robustness of the state-of-the-art (SOTA) causal structure learning approaches applied to fully observed and sampled simulated data across virtual worlds. In addition, we analyzed the feasibility of using machine learning models to predict future social behavior with and without causal knowledge explicitly embedded. In this paper, we first present our causal modeling approach to discover the causal structure of four virtual worlds produced by the simulation teams—Urban Life, Financial Governance, Disaster and Geopolitical Conflict. Our approach adapts the state-of-the-art causal discovery (including ensemble models), machine learning, data analytics, and visualization techniques to allow a human-machine team to reverse-engineer the true causal relations from sampled and fully observed data. We next present our reproducibility analysis of two research methods team’s performance using a range of causal discovery models applied to both sampled and fully observed data, and analyze their effectiveness and limitations. We further investigate the generalizability and robustness to sampling of the SOTA causal discovery approaches on additional simulated datasets with known ground truth. Our results reveal the limitations of existing causal modeling approaches when applied to large-scale, noisy, high-dimensional data with unobserved variables and unknown relationships between them. We show that the SOTA causal models explored in our experiments are not designed to take advantage from vasts amounts of data and have difficulty recovering ground truth when latent confounders are present; they do not generalize well across simulation scenarios and are not robust to sampling; they are vulnerable to data and modeling assumptions, and therefore, the results are hard to reproduce. Finally, when we outline lessons learned and provide recommendations to improve models for causal discovery and prediction of human social behavior from observational data, we highlight the importance of learning data to knowledge representations or transformations to improve causal discovery and describe the benefit of causal feature selection for predictive and prescriptive modeling.

Suggested Citation

  • Svitlana Volkova & Dustin Arendt & Emily Saldanha & Maria Glenski & Ellyn Ayton & Joseph Cottam & Sinan Aksoy & Brett Jefferson & Karthnik Shrivaram, 2023. "Explaining and predicting human behavior and social dynamics in simulated virtual worlds: reproducibility, generalizability, and robustness of causal discovery methods," Computational and Mathematical Organization Theory, Springer, vol. 29(1), pages 220-241, March.
  • Handle: RePEc:spr:comaot:v:29:y:2023:i:1:d:10.1007_s10588-021-09351-y
    DOI: 10.1007/s10588-021-09351-y
    as

    Download full text from publisher

    File URL: http://link.springer.com/10.1007/s10588-021-09351-y
    File Function: Abstract
    Download Restriction: Access to the full text of the articles in this series is restricted.

    File URL: https://libkey.io/10.1007/s10588-021-09351-y?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Watts, Duncan J & Beck, Emorie D & Bienenstock, Elisa Jayne & Bowers, Jake & Frank, Aaron & Grubesic, Anthony & Hofman, Jake M. & Rohrer, Julia Marie & Salganik, Matthew, 2018. "Explanation, prediction, and causality: Three sides of the same coin?," OSF Preprints u6vz5, Center for Open Science.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.

      Corrections

      All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:spr:comaot:v:29:y:2023:i:1:d:10.1007_s10588-021-09351-y. See general information about how to correct material in RePEc.

      If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

      If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

      If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

      For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.springer.com .

      Please note that corrections may take a couple of weeks to filter through the various RePEc services.

      IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.