IDEAS home Printed from https://ideas.repec.org/a/spr/stabio/v15y2023i2d10.1007_s12561-023-09368-8.html
   My bibliography  Save this article

Bayesian Analysis of Multivariate Matched Proportions with Sparse Response

Author

Listed:
  • Mark J. Meyer

    (Georgetown University)

  • Haobo Cheng

    (Georgetown University)

  • Katherine Hobbs Knutson

    (Duke University School of Medicine)

Abstract

Multivariate matched proportions (MMP) data appear in a variety of contexts including post-market surveillance of adverse events in pharmaceuticals, disease classification, and agreement between care providers. It consists of multiple sets of paired binary measurements taken on the same subject. While recent work proposes methods to address the complexities of MMP data, the issue of sparse response, where no or very few “yes” responses are recorded for one or more sets, is unaddressed. The presence of sparse response sets results in the underestimation of variance components, loss of coverage, and lowered power in existing methods. Bayesian methods, which have not previously been considered for MMP data, provide a useful framework when sparse responses are present. In particular, the Bayesian probit model in combination with mean model prior specifications provides an elegant solution to the problem of variance underestimation. We examine a multivariate probit-based approach using hierarchical horseshoe-like priors along with a Bayesian functional principal component analysis (FPCA) to model the latent covariance. We show that our approach performs well on MMP data with sparse responses and outperforms existing methods. In a re-examination of a study on the system of care (SOC) framework for children with mental and behavioral disorders, we are able to provide a more complete picture of the relationships in the data. Our analysis provides additional insights into the functioning on the SOC that a previous univariate analysis missed.

Suggested Citation

  • Mark J. Meyer & Haobo Cheng & Katherine Hobbs Knutson, 2023. "Bayesian Analysis of Multivariate Matched Proportions with Sparse Response," Statistics in Biosciences, Springer;International Chinese Statistical Association, vol. 15(2), pages 490-509, July.
  • Handle: RePEc:spr:stabio:v:15:y:2023:i:2:d:10.1007_s12561-023-09368-8
    DOI: 10.1007/s12561-023-09368-8
    as

    Download full text from publisher

    File URL: http://link.springer.com/10.1007/s12561-023-09368-8
    File Function: Abstract
    Download Restriction: Access to the full text of the articles in this series is restricted.

    File URL: https://libkey.io/10.1007/s12561-023-09368-8?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Consonni, Guido & La Rocca, Luca, 2008. "Tests Based on Intrinsic Priors for the Equality of Two Correlated Proportions," Journal of the American Statistical Association, American Statistical Association, vol. 103(483), pages 1260-1269.
    2. Carlos M. Carvalho & Nicholas G. Polson & James G. Scott, 2010. "The horseshoe estimator for sparse signals," Biometrika, Biometrika Trust, vol. 97(2), pages 465-480.
    3. Quinn McNemar, 1947. "Note on the sampling error of the difference between correlated proportions or percentages," Psychometrika, Springer;The Psychometric Society, vol. 12(2), pages 153-157, June.
    4. Ruppert,David & Wand,M. P. & Carroll,R. J., 2003. "Semiparametric Regression," Cambridge Books, Cambridge University Press, number 9780521785167.
    5. Wolfgang Härdle & Oliver Linton & Wang & Qihua, 2003. "Semiparametric regression analysis with missing response at random," CeMMAP working papers 11/03, Institute for Fiscal Studies.
    6. Jeff Goldsmith & Tomoko Kitago, 2016. "Assessing systematic effects of stroke on motor control by using hierarchical function-on-scalar regression," Journal of the Royal Statistical Society Series C, Royal Statistical Society, vol. 65(2), pages 215-236, February.
    7. Ruppert,David & Wand,M. P. & Carroll,R. J., 2003. "Semiparametric Regression," Cambridge Books, Cambridge University Press, number 9780521780506.
    8. Peter H. Westfall & James F. Troendle & Gene Pennello, 2010. "Multiple McNemar Tests," Biometrics, The International Biometric Society, vol. 66(4), pages 1185-1191, December.
    9. van der Linde, Angelika, 2008. "Variational Bayesian functional PCA," Computational Statistics & Data Analysis, Elsevier, vol. 53(2), pages 517-533, December.
    10. Webb, Emily L. & Forster, Jonathan J., 2008. "Bayesian model determination for multivariate ordinal and binary data," Computational Statistics & Data Analysis, Elsevier, vol. 52(5), pages 2632-2649, January.
    11. Inyoung Kim & Noah D. Cohen & Raymond J. Carroll, 2003. "Semiparametric Regression Splines in Matched Case-Control Studies," Biometrics, The International Biometric Society, vol. 59(4), pages 1158-1169, December.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Kalogridis, Ioannis & Van Aelst, Stefan, 2023. "Robust penalized estimators for functional linear regression," Journal of Multivariate Analysis, Elsevier, vol. 194(C).
    2. Morteza Amini & Mahdi Roozbeh & Nur Anisah Mohamed, 2024. "Separation of the Linear and Nonlinear Covariates in the Sparse Semi-Parametric Regression Model in the Presence of Outliers," Mathematics, MDPI, vol. 12(2), pages 1-17, January.
    3. Giancarlo Aquila & Lucas Barros Scianni Morais & Victor Augusto Durães de Faria & José Wanderley Marangon Lima & Luana Medeiros Marangon Lima & Anderson Rodrigo de Queiroz, 2023. "An Overview of Short-Term Load Forecasting for Electricity Systems Operational Planning: Machine Learning Methods and the Brazilian Experience," Energies, MDPI, vol. 16(21), pages 1-35, November.
    4. Øystein Sørensen & Anders M. Fjell & Kristine B. Walhovd, 2023. "Longitudinal Modeling of Age-Dependent Latent Traits with Generalized Additive Latent and Mixed Models," Psychometrika, Springer;The Psychometric Society, vol. 88(2), pages 456-486, June.
    5. Katarzyna Reluga & María‐José Lombardía & Stefan Sperlich, 2023. "Simultaneous inference for linear mixed model parameters with an application to small area estimation," International Statistical Review, International Statistical Institute, vol. 91(2), pages 193-217, August.
    6. Zanin, Luca, 2023. "A flexible estimation of sectoral portfolio exposure to climate transition risks in the European stock market," Journal of Behavioral and Experimental Finance, Elsevier, vol. 39(C).
    7. Gao, Lisa & Shi, Peng, 2022. "Leveraging high-resolution weather information to predict hail damage claims: A spatial point process for replicated point patterns," Insurance: Mathematics and Economics, Elsevier, vol. 107(C), pages 161-179.
    8. Yu Liu & Chin-Shang Li, 2023. "A linear spline Cox cure model with its applications," Computational Statistics, Springer, vol. 38(2), pages 935-954, June.
    9. Caldeira, João F. & Santos, André A.P. & Torrent, Hudson S., 2023. "Semiparametric portfolios: Improving portfolio performance by exploiting non-linearities in firm characteristics," Economic Modelling, Elsevier, vol. 122(C).
    10. Benjamin Owusu & Bettina Bökemeier & Alfred Greiner, 2023. "Assessing nonlinearities and heterogeneity in debt sustainability analysis: a panel spline approach," Empirical Economics, Springer, vol. 64(3), pages 1315-1346, March.
    11. Hamdy F. F. Mahmoud & Inyoung Kim, 2023. "Semiparametric Integrated and Additive Spatio-Temporal Single-Index Models," Mathematics, MDPI, vol. 11(22), pages 1-15, November.
    12. Otto-Sobotka, Fabian & Salvati, Nicola & Ranalli, Maria Giovanna & Kneib, Thomas, 2019. "Adaptive semiparametric M-quantile regression," Econometrics and Statistics, Elsevier, vol. 11(C), pages 116-129.
    13. Timothy K.M. Beatty & Erling Røed Larsen, 2005. "Using Engel curves to estimate bias in the Canadian CPI as a cost of living index," Canadian Journal of Economics/Revue canadienne d'économique, John Wiley & Sons, vol. 38(2), pages 482-499, May.
    14. Arthur Charpentier & Emmanuel Flachaire & Antoine Ly, 2017. "Econom\'etrie et Machine Learning," Papers 1708.06992, arXiv.org, revised Mar 2018.
    15. Hyunju Son & Youyi Fong, 2021. "Fast grid search and bootstrap‐based inference for continuous two‐phase polynomial regression models," Environmetrics, John Wiley & Sons, Ltd., vol. 32(3), May.
    16. Michael Wegener & Göran Kauermann, 2017. "Forecasting in nonlinear univariate time series using penalized splines," Statistical Papers, Springer, vol. 58(3), pages 557-576, September.
    17. Dlugosz, Stephan & Mammen, Enno & Wilke, Ralf A., 2017. "Generalized partially linear regression with misclassified data and an application to labour market transitions," Computational Statistics & Data Analysis, Elsevier, vol. 110(C), pages 145-159.
    18. Bernhard Baumgartner & Daniel Guhl & Thomas Kneib & Winfried J. Steiner, 2018. "Flexible estimation of time-varying effects for frequently purchased retail goods: a modeling approach based on household panel data," OR Spectrum: Quantitative Approaches in Management, Springer;Gesellschaft für Operations Research e.V., vol. 40(4), pages 837-873, October.
    19. Zi Ye & Giles Hooker & Stephen P. Ellner, 2021. "Generalized Single Index Models and Jensen Effects on Reproduction and Survival," Journal of Agricultural, Biological and Environmental Statistics, Springer;The International Biometric Society;American Statistical Association, vol. 26(3), pages 492-512, September.
    20. Ferraccioli, Federico & Sangalli, Laura M. & Finos, Livio, 2022. "Some first inferential tools for spatial regression with differential regularization," Journal of Multivariate Analysis, Elsevier, vol. 189(C).

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:spr:stabio:v:15:y:2023:i:2:d:10.1007_s12561-023-09368-8. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.springer.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.