IDEAS home Printed from https://ideas.repec.org/a/vrs/demode/v5y2017i1p268-294n16.html
   My bibliography  Save this article

A joint regression modeling framework for analyzing bivariate binary data in R

Author

Listed:
  • Marra Giampiero

    (Department of Statistical Science, University College London, Gower Street, London WC1E 6BT, UK)

  • Radice Rosalba

    (Department of Economics, Mathematics and Statistics, Birkbeck, University of London, Malet Street, London WC1E 7HX, UK)

Abstract

We discuss some of the features of the R add-on package GJRM which implements a flexible joint modeling framework for fitting a number of multivariate response regression models under various sampling schemes. In particular,we focus on the case inwhich the user wishes to fit bivariate binary regression models in the presence of several forms of selection bias. The framework allows for Gaussian and non-Gaussian dependencies through the use of copulae, and for the association and mean parameters to depend on flexible functions of covariates. We describe some of the methodological details underpinning the bivariate binary models implemented in the package and illustrate them by fitting interpretable models of different complexity on three data-sets.

Suggested Citation

  • Marra Giampiero & Radice Rosalba, 2017. "A joint regression modeling framework for analyzing bivariate binary data in R," Dependence Modeling, De Gruyter, vol. 5(1), pages 268-294, December.
  • Handle: RePEc:vrs:demode:v:5:y:2017:i:1:p:268-294:n:16
    DOI: 10.1515/demo-2017-0016
    as

    Download full text from publisher

    File URL: https://doi.org/10.1515/demo-2017-0016
    Download Restriction: no

    File URL: https://libkey.io/10.1515/demo-2017-0016?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Ruppert,David & Wand,M. P. & Carroll,R. J., 2003. "Semiparametric Regression," Cambridge Books, Cambridge University Press, number 9780521785167.
    2. Giampiero Marra & Rosalba Radice & Till Bärnighausen & Simon N. Wood & Mark E. McGovern, 2017. "A Simultaneous Equation Approach to Estimating HIV Prevalence With Nonignorable Missing Responses," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 112(518), pages 484-496, April.
    3. James J. Heckman, 1976. "The Common Structure of Statistical Models of Truncation, Sample Selection and Limited Dependent Variables and a Simple Estimator for Such Models," NBER Chapters, in: Annals of Economic and Social Measurement, Volume 5, number 4, pages 475-492, National Bureau of Economic Research, Inc.
    4. Ehsan Latif, 2009. "The impact of diabetes on employment in Canada," Health Economics, John Wiley & Sons, Ltd., vol. 18(5), pages 577-589, May.
    5. Van de Ven, Wynand P. M. M. & Van Praag, Bernard M. S., 1981. "The demand for deductibles in private health insurance : A probit model with sample selection," Journal of Econometrics, Elsevier, vol. 17(2), pages 229-252, November.
    6. Fearon, James D. & Laitin, David D., 2003. "Ethnicity, Insurgency, and Civil War," American Political Science Review, Cambridge University Press, vol. 97(1), pages 75-90, February.
    7. Rainer Winkelmann, 2012. "Copula Bivariate Probit Models: With An Application To Medical Expenditures," Health Economics, John Wiley & Sons, Ltd., vol. 21(12), pages 1444-1455, December.
    8. Paul Collier & Anke Hoeffler, 2004. "Greed and grievance in civil war," Oxford Economic Papers, Oxford University Press, vol. 56(4), pages 563-595, October.
    9. Simon N. Wood, 2013. "On p-values for smooth components of an extended generalized additive model," Biometrika, Biometrika Trust, vol. 100(1), pages 221-228.
    10. G G Chen & T Åstebro, 2012. "Bound and collapse Bayesian reject inference for credit scoring," Journal of the Operational Research Society, Palgrave Macmillan;The OR Society, vol. 63(10), pages 1374-1387, October.
    11. Heckman, James, 2013. "Sample selection bias as a specification error," Applied Econometrics, Russian Presidential Academy of National Economy and Public Administration (RANEPA), vol. 31(3), pages 129-137.
    12. Edward Frees & Emiliano Valdez, 1998. "Understanding Relationships Using Copulas," North American Actuarial Journal, Taylor & Francis Journals, vol. 2(1), pages 1-25.
    13. McGovern, Mark E. & Bärnighausen, Till & Giampiero Marra & Rosalba Radice, 2015. "On the Assumption of Bivariate Normality in Selection Models: A Copula Approach Applied to Estimating HIV Prevalence," Working Paper 199101, Harvard University OpenScholar.
    14. Goldman D. P. & Bhattacharya J. & McCaffrey D. F. & Duan N. & Leibowitz A. A. & Joyce G. F. & Morton S. C., 2001. "Effect of Insurance on Mortality in an HIV-Positive Population in Care," Journal of the American Statistical Association, American Statistical Association, vol. 96, pages 883-894, September.
    15. Heckman, James J, 1978. "Dummy Endogenous Variables in a Simultaneous Equation System," Econometrica, Econometric Society, vol. 46(4), pages 931-959, July.
    16. Alberto Abadie & David Drukker & Jane Leber Herr & Guido W. Imbens, 2004. "Implementing matching estimators for average treatment effects in Stata," Stata Journal, StataCorp LP, vol. 4(3), pages 290-311, September.
    17. John M. Abowd & Henry S. Farber, 1982. "Job Queues and the Union Status of Workers," ILR Review, Cornell University, ILR School, vol. 35(3), pages 354-367, April.
    18. Simon N. Wood, 2013. "A simple test for random effects in regression models," Biometrika, Biometrika Trust, vol. 100(4), pages 1005-1010.
    19. Paul S. Clarke & Frank Windmeijer, 2012. "Instrumental Variable Estimators for Binary Outcomes," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 107(500), pages 1638-1652, December.
    20. Toomet, Ott & Henningsen, Arne, 2008. "Sample Selection Models in R: Package sampleSelection," Journal of Statistical Software, Foundation for Open Access Statistics, vol. 27(i07).
    21. Gronau, Reuben, 1974. "Wage Comparisons-A Selectivity Bias," Journal of Political Economy, University of Chicago Press, vol. 82(6), pages 1119-1143, Nov.-Dec..
    22. Lewis, H Gregg, 1974. "Comments on Selectivity Biases in Wage Comparisons," Journal of Political Economy, University of Chicago Press, vol. 82(6), pages 1145-1155, Nov.-Dec..
    23. Wojtyś, Magorzata & Marra, Giampiero & Radice, Rosalba, 2016. "Copula Regression Spline Sample Selection Models: The R Package SemiParSampleSel," Journal of Statistical Software, Foundation for Open Access Statistics, vol. 71(i06).
    24. Dan Shane; & Pravin Trivedi;, 2012. "What Drives Differences in Health Care Demand? The Role of Health Insurance and Selection Bias," Health, Econometrics and Data Group (HEDG) Working Papers 12/09, HEDG, c/o Department of Economics, University of York.
    25. Nieman, Mark David, 2015. "Statistical Analysis of Strategic Interaction with Unobserved Player Actions: Introducing a Strategic Probit with Partial Observability," Political Analysis, Cambridge University Press, vol. 23(3), pages 429-448, July.
    26. R. A. Rigby & D. M. Stasinopoulos, 2005. "Generalized additive models for location, scale and shape," Journal of the Royal Statistical Society Series C, Royal Statistical Society, vol. 54(3), pages 507-554, June.
    27. Lorenzo Cappellari & Stephen P. Jenkins, 2003. "Multivariate probit regression using simulated maximum likelihood," Stata Journal, StataCorp LP, vol. 3(3), pages 278-294, September.
    28. Ruppert,David & Wand,M. P. & Carroll,R. J., 2003. "Semiparametric Regression," Cambridge Books, Cambridge University Press, number 9780521780506.
    29. Poirier, Dale J., 1980. "Partial observability in bivariate probit models," Journal of Econometrics, Elsevier, vol. 12(2), pages 209-217, February.
    30. Inyoung Kim & Noah D. Cohen & Raymond J. Carroll, 2003. "Semiparametric Regression Splines in Matched Case-Control Studies," Biometrics, The International Biometric Society, vol. 59(4), pages 1158-1169, December.
    31. Alfonso Miranda & Sophia Rabe-Hesketh, 2006. "Maximum likelihood estimation of endogenous switching and sample selection models for binary, ordinal, and count variables," Stata Journal, StataCorp LP, vol. 6(3), pages 285-308, September.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Marra, Giampiero & Wyszynski, Karol, 2016. "Semi-parametric copula sample selection models for count responses," Computational Statistics & Data Analysis, Elsevier, vol. 104(C), pages 110-129.
    2. Wojtyś, Małgorzata & Marra, Giampiero & Radice, Rosalba, 2018. "Copula based generalized additive models for location, scale and shape with non-random sample selection," Computational Statistics & Data Analysis, Elsevier, vol. 127(C), pages 1-14.
    3. Karol Wyszynski & Giampiero Marra, 2018. "Sample selection models for count data in R," Computational Statistics, Springer, vol. 33(3), pages 1385-1412, September.
    4. Marra, Giampiero & Radice, Rosalba, 2017. "Bivariate copula additive models for location, scale and shape," Computational Statistics & Data Analysis, Elsevier, vol. 112(C), pages 99-113.
    5. Giampiero Marra & Rosalba Radice & Till Bärnighausen & Simon N. Wood & Mark E. McGovern, 2017. "A Simultaneous Equation Approach to Estimating HIV Prevalence With Nonignorable Missing Responses," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 112(518), pages 484-496, April.
    6. Giampiero Marra & Rosalba Radice & Silvia Missiroli, 2014. "Testing the hypothesis of absence of unobserved confounding in semiparametric bivariate probit models," Computational Statistics, Springer, vol. 29(3), pages 715-741, June.
    7. James J. Heckman, 2008. "Econometric Causality," International Statistical Review, International Statistical Institute, vol. 76(1), pages 1-27, April.
    8. Schmidt, Rouven & Kneib, Thomas, 2023. "Multivariate distributional stochastic frontier models," Computational Statistics & Data Analysis, Elsevier, vol. 187(C).
    9. Nadja Klein & Thomas Kneib & Giampiero Marra & Rosalba Radice & Slawa Rokicki & Mark E. McGovern, 2018. "Mixed Binary-Continuous Copula Regression Models with Application to Adverse Birth Outcomes," CHaRMS Working Papers 18-06, Centre for HeAlth Research at the Management School (CHaRMS).
    10. Maike Hohberg & Francesco Donat & Giampiero Marra & Thomas Kneib, 2021. "Beyond unidimensional poverty analysis using distributional copula models for mixed ordered‐continuous outcomes," Journal of the Royal Statistical Society Series C, Royal Statistical Society, vol. 70(5), pages 1365-1390, November.
    11. Nathaniel E. Helwig, 2022. "Robust Permutation Tests for Penalized Splines," Stats, MDPI, vol. 5(3), pages 1-18, September.
    12. Seebens, Holger, 2009. "Child Welfare and Old-Age Security in Female Headed Households in Tanzania," IZA Discussion Papers 3929, Institute of Labor Economics (IZA).
    13. Maarten Goos & Anna Salomons, 2017. "Measuring teaching quality in higher education: assessing selection bias in course evaluations," Research in Higher Education, Springer;Association for Institutional Research, vol. 58(4), pages 341-364, June.
    14. Marra, Giampiero & Radice, Rosalba, 2013. "Estimation of a regression spline sample selection model," Computational Statistics & Data Analysis, Elsevier, vol. 61(C), pages 158-173.
    15. James J. Heckman, 2005. "Micro Data, Heterogeneity and the Evaluation of Public Policy Part 2," The American Economist, Sage Publications, vol. 49(1), pages 16-44, March.
    16. Arthur Charpentier & Emmanuel Flachaire & Antoine Ly, 2017. "Econom\'etrie et Machine Learning," Papers 1708.06992, arXiv.org, revised Mar 2018.
    17. Verbeek, M.J.C.M. & Nijman, T.E., 1992. "Incomplete panels and selection bias : A survey," Discussion Paper 1992-7, Tilburg University, Center for Economic Research.
    18. McGovern, Mark E. & Canning, David & Bärnighausen, Till, 2018. "Accounting for non-response bias using participation incentives and survey design: An application using gift vouchers," Economics Letters, Elsevier, vol. 171(C), pages 239-244.
    19. Ben-Halima, B. & Chusseau, N. & Hellier, J., 2014. "Skill premia and intergenerational education mobility: The French case," Economics of Education Review, Elsevier, vol. 39(C), pages 50-64.
    20. Heckman, James, 2013. "Sample selection bias as a specification error," Applied Econometrics, Russian Presidential Academy of National Economy and Public Administration (RANEPA), vol. 31(3), pages 129-137.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:vrs:demode:v:5:y:2017:i:1:p:268-294:n:16. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Peter Golla (email available below). General contact details of provider: https://www.degruyter.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.