IDEAS home Printed from https://ideas.repec.org/a/sae/somere/v50y2021i3p1259-1283.html
   My bibliography  Save this article

Multiple Imputation Using Gaussian Copulas

Author

Listed:
  • Florian M. Hollenbach
  • Iavor Bojinov
  • Shahryar Minhas
  • Nils W. Metternich
  • Michael D. Ward
  • Alexander Volfovsky

Abstract

Missing observations are pervasive throughout empirical research, especially in the social sciences. Despite multiple approaches to dealing adequately with missing data, many scholars still fail to address this vital issue. In this article, we present a simple-to-use method for generating multiple imputations (MIs) using a Gaussian copula. The Gaussian copula for MI allows scholars to attain estimation results that have good coverage and small bias. The use of copulas to model the dependence among variables will enable researchers to construct valid joint distributions of the data, even without knowledge of the actual underlying marginal distributions. MIs are then generated by drawing observations from the resulting posterior joint distribution and replacing the missing values. Using simulated and observational data from published social science research, we compare imputation via Gaussian copulas with two other widely used imputation methods: multiple imputation via chained equations and Amelia II. Our results suggest that the Gaussian copula approach has a slightly smaller bias, higher coverage rates, and narrower confidence intervals compared to the other methods. This is especially true when the variables with missing data are not normally distributed. These results, combined with theoretical guarantees and ease of use, suggest that the approach examined provides an attractive alternative for applied researchers undertaking MIs.

Suggested Citation

  • Florian M. Hollenbach & Iavor Bojinov & Shahryar Minhas & Nils W. Metternich & Michael D. Ward & Alexander Volfovsky, 2021. "Multiple Imputation Using Gaussian Copulas," Sociological Methods & Research, , vol. 50(3), pages 1259-1283, August.
  • Handle: RePEc:sae:somere:v:50:y:2021:i:3:p:1259-1283
    DOI: 10.1177/0049124118799381
    as

    Download full text from publisher

    File URL: https://journals.sagepub.com/doi/10.1177/0049124118799381
    Download Restriction: no

    File URL: https://libkey.io/10.1177/0049124118799381?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Yucel, Recai M., 2011. "State of the Multiple Imputation Software," Journal of Statistical Software, Foundation for Open Access Statistics, vol. 45(i01).
    2. Kropko, Jonathan & Goodrich, Ben & Gelman, Andrew & Hill, Jennifer, 2014. "Multiple Imputation for Continuous and Categorical Data: Comparing Joint Multivariate Normal and Conditional Approaches," Political Analysis, Cambridge University Press, vol. 22(4), pages 497-519.
    3. Fabrizia Mealli & Donald B. Rubin, 2015. "Clarifying missing at random and related definitions, and implications when coupled with exchangeability," Biometrika, Biometrika Trust, vol. 102(4), pages 995-1000.
    4. King, Gary & Honaker, James & Joseph, Anne & Scheve, Kenneth, 2001. "Analyzing Incomplete Political Science Data: An Alternative Algorithm for Multiple Imputation," American Political Science Review, Cambridge University Press, vol. 95(1), pages 49-69, March.
    5. Michael W. Robbins & Sujit K. Ghosh & Joshua D. Habiger, 2013. "Imputation in High-Dimensional Economic Data as Applied to the Agricultural Resource Management Survey," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 108(501), pages 81-95, March.
    6. F. Di Lascio & Simone Giannerini & Alessandra Reale, 2015. "Exploring copulas for the imputation of complex dependent data," Statistical Methods & Applications, Springer;Società Italiana di Statistica, vol. 24(1), pages 159-175, March.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Burns, Christopher & Prager, Daniel & Ghosh, Sujit & Goodwin, Barry, 2015. "Imputing for Missing Data in the ARMS Household Section: A Multivariate Imputation Approach," 2015 AAEA & WAEA Joint Annual Meeting, July 26-28, San Francisco, California 205291, Agricultural and Applied Economics Association.
    2. Josse, Julie & Husson, François, 2016. "missMDA: A Package for Handling Missing Values in Multivariate Data Analysis," Journal of Statistical Software, Foundation for Open Access Statistics, vol. 70(i01).
    3. Schalk Burger & Searle Silverman & Gary van Vuuren, 2018. "Deriving Correlation Matrices for Missing Financial Time-Series Data," International Journal of Economics and Finance, Canadian Center of Science and Education, vol. 10(10), pages 105-105, October.
    4. Sophia Rabe-Hesketh & Anders Skrondal, 2023. "Ignoring Non-ignorable Missingness," Psychometrika, Springer;The Psychometric Society, vol. 88(1), pages 31-50, March.
    5. Scott Gehlbach & Konstantin Sonin & Ekaterina Zhuravskaya, 2010. "Businessman Candidates," American Journal of Political Science, John Wiley & Sons, vol. 54(3), pages 718-736, July.
    6. Ihle, R. & Amikuzuno, J. & von Cramon-Taubadel, S. & Zorya, S., 2010. "Grenzeffekte in der Marktintegration bei Mais in Ostafrika: Einsichten aus einem semi-parametrischen Regressionsmodell," Proceedings “Schriften der Gesellschaft für Wirtschafts- und Sozialwissenschaften des Landbaues e.V.”, German Association of Agricultural Economists (GEWISOLA), vol. 45, March.
    7. Matthew Blackwell & James Honaker & Gary King, 2017. "A Unified Approach to Measurement Error and Missing Data: Overview and Applications," Sociological Methods & Research, , vol. 46(3), pages 303-341, August.
    8. Vincent Bauer & Keven Ruby & Robert Pape, 2017. "Solving the Problem of Unattributed Political Violence," Journal of Conflict Resolution, Peace Science Society (International), vol. 61(7), pages 1537-1564, August.
    9. Paul Poast, 2013. "Issue linkage and international cooperation: An empirical investigation," Conflict Management and Peace Science, Peace Science Society (International), vol. 30(3), pages 286-303, July.
    10. Cohen, Joseph N, 2010. "Neoliberalism’s relationship with economic growth in the developing world: Was it the power of the market or the resolution of financial crisis?," MPRA Paper 24527, University Library of Munich, Germany.
    11. You, Jong-Sung & Khagram, Sanjeev, 2004. "Inequality and Corruption," Working Paper Series rwp04-001, Harvard University, John F. Kennedy School of Government.
    12. Sergei Guriev & Daniel Treisman, 2020. "The Popularity of Authoritarian Leaders: A cross-national investigation," Post-Print hal-03878626, HAL.
    13. Julia Cage & Yasmine Bekkouche, 2018. "The Price of a Vote: Evidence from France, 1993-2014," Sciences Po publications 12614, Sciences Po.
    14. Bruno Versailles, 2012. "Market Integration and Border Effects in Eastern Africa," Economics Series Working Papers WPS/2012-01, University of Oxford, Department of Economics.
    15. Antonio Filippin & Luca Nunziata, 2019. "Monetary effects of inequality: lessons from the euro experiment," The Journal of Economic Inequality, Springer;Society for the Study of Economic Inequality, vol. 17(2), pages 99-124, June.
    16. Robert Grafstein, 2009. "Antisocial Security: The Puzzle of Beggar‐Thy‐Children Policies," American Journal of Political Science, John Wiley & Sons, vol. 53(3), pages 710-725, July.
    17. Wurriehausen, Nadine & Ihle, Rico & Lakner, Sebastian, 2011. "The Integration of the Conventional and Organic Wheat Market," 2011 International Congress, August 30-September 2, 2011, Zurich, Switzerland 115784, European Association of Agricultural Economists.
    18. Sebastian Barfort & Nikolaj Harmon & Frederik Hjorth & Asmus Leth Olsen, 2015. "Dishonesty and Selection into Public Service in Denmark: Who Runs the World’s Least Corrupt Public Sector?," Discussion Papers 15-12, University of Copenhagen. Department of Economics.
    19. Manthos D. Delis & Iftekhar Hasan & Pantelis Kazakis, 2014. "Bank Regulations and Income Inequality: Empirical Evidence," Review of Finance, European Finance Association, vol. 18(5), pages 1811-1846.
    20. Alessandro Bitetto & Paola Cerchiello & Charilaos Mertzanis, 2021. "A data-driven approach to measuring epidemiological susceptibility risk around the world," DEM Working Papers Series 200, University of Pavia, Department of Economics and Management.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:sae:somere:v:50:y:2021:i:3:p:1259-1283. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: SAGE Publications (email available below). General contact details of provider: .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.