IDEAS home Printed from https://ideas.repec.org/a/bla/scjsta/v50y2023i1p120-141.html
   My bibliography  Save this article

Learning the two parameters of the Poisson–Dirichlet distribution with a forensic application

Author

Listed:
  • Giulia Cereda
  • Fabio Corradi
  • Cecilia Viscardi

Abstract

In forensic science, the rare type match problem arises when the matching characteristic from the suspect and the crime scene is not in the reference database; hence, it is difficult to evaluate the likelihood ratio that compares the defense and prosecution hypotheses. A recent solution consists of modeling the ordered population probabilities according to the two‐parameter Poisson–Dirichlet distribution, which is a well‐known Bayesian nonparametric prior, and plugging the maximum likelihood estimates of the parameters into the likelihood ratio. We demonstrate that this approximation produces a systematic bias that fully Bayesian inference avoids. Motivated by this forensic application, we consider the need to learn the posterior distribution of the parameters that governs the two‐parameter Poisson–Dirichlet using two sampling methods: Markov Chain Monte Carlo and approximate Bayesian computation. These methods are evaluated in terms of accuracy and efficiency. Finally, we compare the likelihood ratio that is obtained by our proposal with the existing solution using a database of Y‐chromosome haplotypes.

Suggested Citation

  • Giulia Cereda & Fabio Corradi & Cecilia Viscardi, 2023. "Learning the two parameters of the Poisson–Dirichlet distribution with a forensic application," Scandinavian Journal of Statistics, Danish Society for Theoretical Statistics;Finnish Statistical Society;Norwegian Statistical Association;Swedish Statistical Association, vol. 50(1), pages 120-141, March.
  • Handle: RePEc:bla:scjsta:v:50:y:2023:i:1:p:120-141
    DOI: 10.1111/sjos.12575
    as

    Download full text from publisher

    File URL: https://doi.org/10.1111/sjos.12575
    Download Restriction: no

    File URL: https://libkey.io/10.1111/sjos.12575?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Giulia Cereda, 2017. "Impact of Model Choice on LR Assessment in Case of Rare Haplotype Match (Frequentist Approach)," Scandinavian Journal of Statistics, Danish Society for Theoretical Statistics;Finnish Statistical Society;Norwegian Statistical Association;Swedish Statistical Association, vol. 44(1), pages 230-248, March.
    2. Paul Fearnhead & Dennis Prangle, 2012. "Constructing summary statistics for approximate Bayesian computation: semi-automatic approximate Bayesian computation," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 74(3), pages 419-474, June.
    3. Stefano Favaro & Antonio Lijoi & Ramsés H. Mena & Igor Prünster, 2009. "Bayesian nonparametric inference for species variety with a two parameter Poisson-Dirichlet process prior," Carlo Alberto Notebooks 123, Collegio Carlo Alberto.
    4. Giulia Cereda, 2017. "Bayesian approach to LR assessment in case of rare type match," Statistica Neerlandica, Netherlands Society for Statistics and Operations Research, vol. 71(2), pages 141-164, May.
    5. Mark A. Beaumont & Jean-Marie Cornuet & Jean-Michel Marin & Christian P. Robert, 2009. "Adaptive approximate Bayesian computation," Biometrika, Biometrika Trust, vol. 96(4), pages 983-990.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Zhang, Junyi & Dassios, Angelos, 2023. "Truncated two-parameter Poisson-Dirichlet approximation for Pitman-Yor process hierarchical models," LSE Research Online Documents on Economics 120294, London School of Economics and Political Science, LSE Library.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Xing Ju Lee & Christopher C. Drovandi & Anthony N. Pettitt, 2015. "Model choice problems using approximate Bayesian computation with applications to pathogen transmission data sets," Biometrics, The International Biometric Society, vol. 71(1), pages 198-207, March.
    2. McKinley, Trevelyan J. & Ross, Joshua V. & Deardon, Rob & Cook, Alex R., 2014. "Simulation-based Bayesian inference for epidemic models," Computational Statistics & Data Analysis, Elsevier, vol. 71(C), pages 434-447.
    3. Li, J. & Nott, D.J. & Fan, Y. & Sisson, S.A., 2017. "Extending approximate Bayesian computation methods to high dimensions via a Gaussian copula model," Computational Statistics & Data Analysis, Elsevier, vol. 106(C), pages 77-89.
    4. Gael M. Martin & David T. Frazier & Christian P. Robert, 2020. "Computing Bayes: Bayesian Computation from 1763 to the 21st Century," Monash Econometrics and Business Statistics Working Papers 14/20, Monash University, Department of Econometrics and Business Statistics.
    5. Pierre-Olivier Goffard & Patrick Laub, 2021. "Approximate Bayesian Computations to fit and compare insurance loss models," Working Papers hal-02891046, HAL.
    6. Bertl Johanna & Ewing Gregory & Kosiol Carolin & Futschik Andreas, 2017. "Approximate maximum likelihood estimation for population genetic inference," Statistical Applications in Genetics and Molecular Biology, De Gruyter, vol. 16(5-6), pages 387-405, December.
    7. Anthony Ebert & Ritabrata Dutta & Kerrie Mengersen & Antonietta Mira & Fabrizio Ruggeri & Paul Wu, 2021. "Likelihood‐free parameter estimation for dynamic queueing networks: Case study of passenger flow in an international airport terminal," Journal of the Royal Statistical Society Series C, Royal Statistical Society, vol. 70(3), pages 770-792, June.
    8. Gael M. Martin & David T. Frazier & Christian P. Robert, 2021. "Approximating Bayes in the 21st Century," Monash Econometrics and Business Statistics Working Papers 24/21, Monash University, Department of Econometrics and Business Statistics.
    9. Alexander Buchholz & Nicolas CHOPIN, 2017. "Improving approximate Bayesian computation via quasi Monte Carlo," Working Papers 2017-37, Center for Research in Economics and Statistics.
    10. Brenda N Vo & Christopher C Drovandi & Anthony N Pettitt & Graeme J Pettet, 2015. "Melanoma Cell Colony Expansion Parameters Revealed by Approximate Bayesian Computation," PLOS Computational Biology, Public Library of Science, vol. 11(12), pages 1-22, December.
    11. Frazier, David T. & Maneesoonthorn, Worapree & Martin, Gael M. & McCabe, Brendan P.M., 2019. "Approximate Bayesian forecasting," International Journal of Forecasting, Elsevier, vol. 35(2), pages 521-539.
    12. Buzbas, Erkan O. & Rosenberg, Noah A., 2015. "AABC: Approximate approximate Bayesian computation for inference in population-genetic models," Theoretical Population Biology, Elsevier, vol. 99(C), pages 31-42.
    13. Genya Kobayashi & Kazuhiko Kakamu, 2019. "Approximate Bayesian computation for Lorenz curves from grouped data," Computational Statistics, Springer, vol. 34(1), pages 253-279, March.
    14. Kobayashi, Genya, 2014. "A transdimensional approximate Bayesian computation using the pseudo-marginal approach for model choice," Computational Statistics & Data Analysis, Elsevier, vol. 80(C), pages 167-183.
    15. Creel, Michael & Kristensen, Dennis, 2015. "ABC of SV: Limited information likelihood inference in stochastic volatility jump-diffusion models," Journal of Empirical Finance, Elsevier, vol. 31(C), pages 85-108.
    16. Gareth W. Peters & Efstathios Panayi & Francois Septier, 2015. "SMC-ABC methods for the estimation of stochastic simulation models of the limit order book," Papers 1504.05806, arXiv.org.
    17. Mikael Sunnåker & Alberto Giovanni Busetto & Elina Numminen & Jukka Corander & Matthieu Foll & Christophe Dessimoz, 2013. "Approximate Bayesian Computation," PLOS Computational Biology, Public Library of Science, vol. 9(1), pages 1-10, January.
    18. Chiachío, Manuel & Saleh, Ali & Naybour, Susannah & Chiachío, Juan & Andrews, John, 2022. "Reduction of Petri net maintenance modeling complexity via Approximate Bayesian Computation," Reliability Engineering and System Safety, Elsevier, vol. 222(C).
    19. Warne, David J. & Baker, Ruth E. & Simpson, Matthew J., 2018. "Multilevel rejection sampling for approximate Bayesian computation," Computational Statistics & Data Analysis, Elsevier, vol. 124(C), pages 71-86.
    20. Creel, Michael & Kristensen, Dennis, 2016. "On selection of statistics for approximate Bayesian computing (or the method of simulated moments)," Computational Statistics & Data Analysis, Elsevier, vol. 100(C), pages 99-114.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:bla:scjsta:v:50:y:2023:i:1:p:120-141. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Wiley Content Delivery (email available below). General contact details of provider: http://www.blackwellpublishing.com/journal.asp?ref=0303-6898 .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.