IDEAS home Printed from https://ideas.repec.org/a/spr/metrik/v82y2019i3d10.1007_s00184-018-0689-5.html
   My bibliography  Save this article

A modification of Chao’s lower bound estimator in the case of one-inflation

Author

Listed:
  • Dankmar Böhning

    (University of Southampton)

  • Panicha Kaskasamkul

    (Naresuan University)

  • Peter G. M. Heijden

    (University of Southampton
    University of Utrecht)

Abstract

For zero-truncated count data, as they typically arise in capture-recapture modelling, the nonparametric lower bound estimator of Chao is a frequently used estimator of population size. It is a simple, nonparametric estimator involving only counts of one and counts of two. The estimator is asymptotically unbiased if the count distribution is a member of the power series family and is providing a lower bound estimator if the distribution is a mixture of a member of the power series family. However, if there is one-inflation Chao’s estimator can severely overestimate as we show here. This is also illustrated by routinely collected country-wide data on family violence in the Netherlands. A new lower bound estimator is developed which involves only counts of twos and threes, thus avoiding the overestimation caused by one-inflation. We show that the new estimator is asymptotically unbiased for a power series distribution with and without one-inflation and provides a lower bound estimator under a mixture of power series distributions with and without one-inflation. For all estimators bias-adjusted versions are developed that reduce the bias considerably when the sample size is small. A simulation study compares the modified Chao estimator with the conventional estimator as well as with an estimator suggested by Chiu and Chao more recently.

Suggested Citation

  • Dankmar Böhning & Panicha Kaskasamkul & Peter G. M. Heijden, 2019. "A modification of Chao’s lower bound estimator in the case of one-inflation," Metrika: International Journal for Theoretical and Applied Statistics, Springer, vol. 82(3), pages 361-384, April.
  • Handle: RePEc:spr:metrik:v:82:y:2019:i:3:d:10.1007_s00184-018-0689-5
    DOI: 10.1007/s00184-018-0689-5
    as

    Download full text from publisher

    File URL: http://link.springer.com/10.1007/s00184-018-0689-5
    File Function: Abstract
    Download Restriction: Access to the full text of the articles in this series is restricted.

    File URL: https://libkey.io/10.1007/s00184-018-0689-5?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Chun-Huo Chiu & Yi-Ting Wang & Bruno A. Walther & Anne Chao, 2014. "An improved nonparametric lower bound of species richness via a modified good–turing frequency formula," Biometrics, The International Biometric Society, vol. 70(3), pages 671-682, September.
    2. Pedro Puig & Célestin C. Kokonendji, 2018. "Non†parametric Estimation of the Number of Zeros in Truncated Count Distributions," Scandinavian Journal of Statistics, Danish Society for Theoretical Statistics;Finnish Statistical Society;Norwegian Statistical Association;Swedish Statistical Association, vol. 45(2), pages 347-365, June.
    3. Amy Willis & John Bunge, 2015. "Estimating diversity via frequency ratios," Biometrics, The International Biometric Society, vol. 71(4), pages 1042-1049, December.
    4. Mao, Chang Xuan, 2006. "Inference on the Number of Species Through Geometric Lower Bounds," Journal of the American Statistical Association, American Statistical Association, vol. 101, pages 1663-1670, December.
    5. William A. Link, 2003. "Nonidentifiability of Population Size from Capture-Recapture Data with Heterogeneous Detection Probabilities," Biometrics, The International Biometric Society, vol. 59(4), pages 1123-1130, December.
    6. Wang, Ji-Ping Z. & Lindsay, Bruce G., 2005. "A Penalized Nonparametric Maximum Likelihood Approach to Species Richness Estimation," Journal of the American Statistical Association, American Statistical Association, vol. 100, pages 942-959, September.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Dankmar Böhning & Rattana Lerdsuwansri & Patarawan Sangnawakij, 2023. "Modeling COVID‐19 contact‐tracing using the ratio regression capture–recapture approach," Biometrics, The International Biometric Society, vol. 79(4), pages 3818-3830, December.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Farcomeni, Alessio & Dotto, Francesco, 2021. "A correction to make Chao estimator conservative when the number of sampling occasions is finite," Statistics & Probability Letters, Elsevier, vol. 176(C).
    2. Chang Xuan Mao & Cuiying Yang & Yitong Yang & Wei Zhuang, 2017. "Estimating population sizes with the Rasch model," Annals of the Institute of Statistical Mathematics, Springer;The Institute of Statistical Mathematics, vol. 69(3), pages 705-716, June.
    3. Dankmar Böhning & Alberto Vidal-Diez & Rattana Lerdsuwansri & Chukiat Viwatwongkasem & Mark Arnold, 2013. "A Generalization of Chao's Estimator for Covariate Information," Biometrics, The International Biometric Society, vol. 69(4), pages 1033-1042, December.
    4. Orasa Anan & Dankmar Böhning & Antonello Maruotti, 2017. "Population size estimation and heterogeneity in capture–recapture data: a linear regression estimator based on the Conway–Maxwell–Poisson distribution," Statistical Methods & Applications, Springer;Società Italiana di Statistica, vol. 26(1), pages 49-79, March.
    5. Stilianos Louca & Florent Mazel & Michael Doebeli & Laura Wegener Parfrey, 2019. "A census-based estimate of Earth's bacterial and archaeal diversity," PLOS Biology, Public Library of Science, vol. 17(2), pages 1-30, February.
    6. Sa-aat Niwitpong & Dankmar Böhning & Peter Heijden & Heinz Holling, 2013. "Capture–recapture estimation based upon the geometric distribution allowing for heterogeneity," Metrika: International Journal for Theoretical and Applied Statistics, Springer, vol. 76(4), pages 495-519, May.
    7. Paul S. F. Yip & Hua-Zhen Lin & Liqun Xi, 2005. "A Semiparametric Method for Estimating Population Size for Capture–Recapture Experiments with Random Covariates in Continuous Time," Biometrics, The International Biometric Society, vol. 61(4), pages 1085-1092, December.
    8. Ján Kulfan & Lenka Sarvašová & Michal Parák & Marek Dzurenko & Peter Zach, 2018. "Can late flushing trees avoid attack by moth larvae in temperate forests?," Plant Protection Science, Czech Academy of Agricultural Sciences, vol. 54(4), pages 272-283.
    9. Chang Xuan Mao & Na You, 2009. "On Comparison of Mixture Models for Closed Population Capture–Recapture Studies," Biometrics, The International Biometric Society, vol. 65(2), pages 547-553, June.
    10. repec:jss:jstsof:40:i09 is not listed on IDEAS
    11. Hajo Holzmann & Axel Munk & Walter Zucchini, 2006. "On Identifiability in Capture–Recapture Models," Biometrics, The International Biometric Society, vol. 62(3), pages 934-936, September.
    12. repec:jss:jstsof:19:i05 is not listed on IDEAS
    13. J. Andrew Royle, 2009. "Analysis of Capture–Recapture Models with Individual Covariates Using Data Augmentation," Biometrics, The International Biometric Society, vol. 65(1), pages 267-274, March.
    14. Zhang, Hongmei & Ghosh, Kaushik & Ghosh, Pulak, 2012. "Sampling designs via a multivariate hypergeometric-Dirichlet process model for a multi-species assemblage with unknown heterogeneity," Computational Statistics & Data Analysis, Elsevier, vol. 56(8), pages 2562-2573.
    15. Huggins, Richard, 2007. "On the use of linear models in the estimation of the size of a population using capture-recapture data," Statistics & Probability Letters, Elsevier, vol. 77(6), pages 649-653, March.
    16. Seungchul Baek & Junyong Park, 2022. "A computationally efficient approach to estimating species richness and rarefaction curve," Computational Statistics, Springer, vol. 37(4), pages 1919-1941, September.
    17. Olivier Binette & Rebecca C. Steorts, 2022. "On the reliability of multiple systems estimation for the quantification of modern slavery," Journal of the Royal Statistical Society Series A, Royal Statistical Society, vol. 185(2), pages 640-676, April.
    18. Louis-Paul Rivest & Sophie Baillargeon, 2007. "Applications and Extensions of Chao's Moment Estimator for the Size of a Closed Population," Biometrics, The International Biometric Society, vol. 63(4), pages 999-1006, December.
    19. Robert M. Dorazio & Bhramar Mukherjee & Li Zhang & Malay Ghosh & Howard L. Jelks & Frank Jordan, 2008. "Modeling Unobserved Sources of Heterogeneity in Animal Abundance Using a Dirichlet Process Prior," Biometrics, The International Biometric Society, vol. 64(2), pages 635-644, June.
    20. J. Andrew Royle, 2006. "Site Occupancy Models with Heterogeneous Detection Probabilities," Biometrics, The International Biometric Society, vol. 62(1), pages 97-102, March.
    21. William A. Link, 2006. "The author replied as follows:," Biometrics, The International Biometric Society, vol. 62(3), pages 936-939, September.
    22. Balabdaoui, Fadoua & Kulagina, Yulia, 2020. "Completely monotone distributions: Mixing, approximation and estimation of number of species," Computational Statistics & Data Analysis, Elsevier, vol. 150(C).

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:spr:metrik:v:82:y:2019:i:3:d:10.1007_s00184-018-0689-5. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.springer.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.