IDEAS home Printed from https://ideas.repec.org/a/bla/jorssb/v70y2008i4p725-738.html
   My bibliography  Save this article

Non‐parametric inference for clustered binary and count data when only summary information is available

Author

Listed:
  • Peter Hall
  • Tapabrata Maiti

Abstract

Summary. Data in the form of pairs (X,Y), where the response Y is a count, arise in many applications, including problems involving stratified or two‐stage sampling. Such data are often analysed by using random‐effects models, where the distribution of Y, conditional on X and on an unobserved random parameter Θ, is taken to be either binomial or Poisson, and the distribution of Θ is connected through a link function to a random effect. The latter is sometimes supposed to be normally distributed, but that assumption can lead to serious biases if it is not satisfied. This difficulty has motivated an extensive literature on non‐parametric techniques for cases where the random‐effect distribution is unknown, but that methodology requires detailed cluster level data. No non‐parametric techniques are available for instances where only summary data are accessible. We show that the random‐effect distribution is actually non‐parametrically identifiable from crude summary data, and we suggest a sieve method for inference in this case. Non‐parametric approaches to both parameter estimation and prediction are introduced, and empirical techniques are developed for choosing tuning parameters. These methods are shown to outperform their parametric counterparts when the model is misspecified, and to perform almost as well as parametric methods when the model is correct.

Suggested Citation

  • Peter Hall & Tapabrata Maiti, 2008. "Non‐parametric inference for clustered binary and count data when only summary information is available," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 70(4), pages 725-738, September.
  • Handle: RePEc:bla:jorssb:v:70:y:2008:i:4:p:725-738
    DOI: 10.1111/j.1467-9868.2008.00658.x
    as

    Download full text from publisher

    File URL: https://doi.org/10.1111/j.1467-9868.2008.00658.x
    Download Restriction: no

    File URL: https://libkey.io/10.1111/j.1467-9868.2008.00658.x?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Huageng Tao & Mari Palta & Brian S. Yandell & Michael A. Newton, 1999. "An Estimation Method for the Semiparametric Mixed Effects Model," Biometrics, The International Biometric Society, vol. 55(1), pages 102-110, March.
    2. Delaigle, A. & Gijbels, I., 2004. "Practical bandwidth selection in deconvolution kernel density estimation," Computational Statistics & Data Analysis, Elsevier, vol. 45(2), pages 249-267, March.
    3. A. Delaigle & I. Gijbels, 2002. "Estimation of integrated squared density derivatives from a contaminated sample," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 64(4), pages 869-886, October.
    4. Peter Hall & Tapabrata Maiti, 2006. "On parametric bootstrap methods for small area prediction," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 68(2), pages 221-238, April.
    5. Bert Van Es & Hae‐Won Uh, 2005. "Asymptotic Normality of Kernel‐Type Deconvolution Estimators," Scandinavian Journal of Statistics, Danish Society for Theoretical Statistics;Finnish Statistical Society;Norwegian Statistical Association;Swedish Statistical Association, vol. 32(3), pages 467-483, September.
    6. Wendimagegn Ghidey & Emmanuel Lesaffre & Paul Eilers, 2004. "Smooth Random Effects Distribution in a Linear Mixed Model," Biometrics, The International Biometric Society, vol. 60(4), pages 945-953, December.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Peter Hall & Tapabrata Maiti, 2009. "Deconvolution methods for non‐parametric inference in two‐level mixed models," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 71(3), pages 703-718, June.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Staudenmayer, John & Ruppert, David & Buonaccorsi, John P., 2008. "Density Estimation in the Presence of Heteroscedastic Measurement Error," Journal of the American Statistical Association, American Statistical Association, vol. 103, pages 726-736, June.
    2. Yousri Slaoui, 2021. "Data-driven Deconvolution Recursive Kernel Density Estimators Defined by Stochastic Approximation Method," Sankhya A: The Indian Journal of Statistics, Springer;Indian Statistical Institute, vol. 83(1), pages 312-352, February.
    3. Bissantz, Nicolai & Dümbgen, Lutz & Holzmann, Hajo & Munk, Axel, 2007. "Nonparametric confidence bands in deconvolution density estimation," Technical Reports 2007,03, Technische Universität Dortmund, Sonderforschungsbereich 475: Komplexitätsreduktion in multivariaten Datenstrukturen.
    4. Ali Al-Sharadqah & Majid Mojirsheibani & William Pouliot, 2020. "On the performance of weighted bootstrapped kernel deconvolution density estimators," Statistical Papers, Springer, vol. 61(4), pages 1773-1798, August.
    5. Yang Zu, 2015. "A Note on the Asymptotic Normality of the Kernel Deconvolution Density Estimator with Logarithmic Chi-Square Noise," Econometrics, MDPI, vol. 3(3), pages 1-16, July.
    6. Peter Hall & Tapabrata Maiti, 2009. "Deconvolution methods for non‐parametric inference in two‐level mixed models," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 71(3), pages 703-718, June.
    7. Parmeter, Christopher F., 2008. "The effect of measurement error on the estimated shape of the world distribution of income," Economics Letters, Elsevier, vol. 100(3), pages 373-376, September.
    8. Kato, Kengo & Sasaki, Yuya, 2018. "Uniform confidence bands in deconvolution with unknown error distribution," Journal of Econometrics, Elsevier, vol. 207(1), pages 129-161.
    9. Holzmann, Hajo & Bissantz, Nicolai & Munk, Axel, 2007. "Density testing in a contaminated sample," Journal of Multivariate Analysis, Elsevier, vol. 98(1), pages 57-75, January.
    10. Dong, Hao & Otsu, Taisuke & Taylor, Luke, 2021. "Average Derivative Estimation Under Measurement Error," Econometric Theory, Cambridge University Press, vol. 37(5), pages 1004-1033, October.
    11. Rendao Ye & Tonghui Wang & Saowanit Sukparungsee & Arjun Gupta, 2015. "Tests in variance components models under skew-normal settings," Metrika: International Journal for Theoretical and Applied Statistics, Springer, vol. 78(7), pages 885-904, October.
    12. Otsu, Taisuke & Taylor, Luke, 2021. "Specification Testing For Errors-In-Variables Models," Econometric Theory, Cambridge University Press, vol. 37(4), pages 747-768, August.
    13. William Horrace & Christopher Parmeter, 2011. "Semiparametric deconvolution with unknown error variance," Journal of Productivity Analysis, Springer, vol. 35(2), pages 129-141, April.
    14. Zeinolabedin Najafi & Karim Zare & Mohammad Reza Mahmoudi & Soheil Shokri & Amir Mosavi, 2022. "Inference and Local Influence Assessment in a Multifactor Skew-Normal Linear Mixed Model," Mathematics, MDPI, vol. 10(15), pages 1-21, August.
    15. Martin L. Hazelton & Berwin A. Turlach, 2010. "Semiparametric Density Deconvolution," Scandinavian Journal of Statistics, Danish Society for Theoretical Statistics;Finnish Statistical Society;Norwegian Statistical Association;Swedish Statistical Association, vol. 37(1), pages 91-108, March.
    16. Cornelis J. Potgieter, 2020. "Density deconvolution for generalized skew-symmetric distributions," Journal of Statistical Distributions and Applications, Springer, vol. 7(1), pages 1-20, December.
    17. Delaigle, A. & Gijbels, I., 2006. "Data-driven boundary estimation in deconvolution problems," Computational Statistics & Data Analysis, Elsevier, vol. 50(8), pages 1965-1994, April.
    18. Ho, Remus K.W. & Hu, Inchi, 2008. "Flexible modelling of random effects in linear mixed models--A Bayesian approach," Computational Statistics & Data Analysis, Elsevier, vol. 52(3), pages 1347-1361, January.
    19. Kubokawa, Tatsuya & Nagashima, Bui, 2012. "Parametric bootstrap methods for bias correction in linear mixed models," Journal of Multivariate Analysis, Elsevier, vol. 106(C), pages 1-16.
    20. Kuosmanen, Timo & Johnson, Andrew, 2017. "Modeling joint production of multiple outputs in StoNED: Directional distance function approach," European Journal of Operational Research, Elsevier, vol. 262(2), pages 792-801.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:bla:jorssb:v:70:y:2008:i:4:p:725-738. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Wiley Content Delivery (email available below). General contact details of provider: https://edirc.repec.org/data/rssssea.html .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.