IDEAS home Printed from https://ideas.repec.org/a/vrs/demode/v7y2019i1p45-52n3.html
   My bibliography  Save this article

A simple proof of Pitman–Yor’s Chinese restaurant process from its stick-breaking representation

Author

Listed:
  • Lawless Caroline
  • Arbel Julyan

    (Univ. Grenoble Alpes, Inria, CNRS, LJK, 38000Grenoble, France)

Abstract

For a long time, the Dirichlet process has been the gold standard discrete random measure in Bayesian nonparametrics. The Pitman-Yor process provides a simple and mathematically tractable generalization, allowing for a very flexible control of the clustering behaviour. Two commonly used representations of the Pitman-Yor process are the stick-breaking process and the Chinese restaurant process. The former is a constructive representation of the process which turns out very handy for practical implementation, while the latter describes the partition distribution induced. Obtaining one from the other is usually done indirectly with use of measure theory. In contrast, we propose here an elementary proof of Pitman-Yor’s Chinese Restaurant process from its stick-breaking representation.

Suggested Citation

  • Lawless Caroline & Arbel Julyan, 2019. "A simple proof of Pitman–Yor’s Chinese restaurant process from its stick-breaking representation," Dependence Modeling, De Gruyter, vol. 7(1), pages 45-52, March.
  • Handle: RePEc:vrs:demode:v:7:y:2019:i:1:p:45-52:n:3
    DOI: 10.1515/demo-2019-0003
    as

    Download full text from publisher

    File URL: https://doi.org/10.1515/demo-2019-0003
    Download Restriction: no

    File URL: https://libkey.io/10.1515/demo-2019-0003?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Giovanni De Luca & Paola Zuccolotto, 2011. "A tail dependence-based dissimilarity measure for financial time series clustering," Advances in Data Analysis and Classification, Springer;German Classification Society - Gesellschaft für Klassifikation (GfKl);Japanese Classification Society (JCS);Classification and Data Analysis Group of the Italian Statistical Society (CLADAG);International Federation of Classification Societies (IFCS), vol. 5(4), pages 323-340, December.
    2. Bruno Scarpa & David B. Dunson, 2009. "Bayesian Hierarchical Functional Data Analysis Via Contaminated Informative Priors," Biometrics, The International Biometric Society, vol. 65(3), pages 772-780, September.
    3. Yang Ni & Peter Müller & Yitan Zhu & Yuan Ji, 2018. "Heterogeneous reciprocal graphical models," Biometrics, The International Biometric Society, vol. 74(2), pages 606-615, June.
    4. Stefano Favaro & Antonio Lijoi & Ramsés H. Mena & Igor Prünster, 2009. "Bayesian non‐parametric inference for species variety with a two‐parameter Poisson–Dirichlet process prior," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 71(5), pages 993-1008, November.
    5. Ishwaran H. & James L. F, 2001. "Gibbs Sampling Methods for Stick Breaking Priors," Journal of the American Statistical Association, American Statistical Association, vol. 96, pages 161-173, March.
    6. Ghosal,Subhashis & van der Vaart,Aad, 2017. "Fundamentals of Nonparametric Bayesian Inference," Cambridge Books, Cambridge University Press, number 9780521878265.
    7. Miller, Jeffrey W., 2019. "An elementary derivation of the Chinese restaurant process from Sethuraman’s stick-breaking process," Statistics & Probability Letters, Elsevier, vol. 146(C), pages 112-117.
    8. Bassetti, Federico & Casarin, Roberto & Leisen, Fabrizio, 2014. "Beta-product dependent Pitman–Yor processes for Bayesian inference," Journal of Econometrics, Elsevier, vol. 180(1), pages 49-72.
    9. Stefano Favaro & Antonio Lijoi & Ramsés H. Mena & Igor Prünster, 2009. "Bayesian nonparametric inference for species variety with a two parameter Poisson-Dirichlet process prior," Carlo Alberto Notebooks 123, Collegio Carlo Alberto.
    10. A. Canale & A. Lijoi & B. Nipoti & I. Prünster, 2017. "On the Pitman–Yor process with spike and slab base measure," Biometrika, Biometrika Trust, vol. 104(3), pages 681-697.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Weixuan Zhu & Fabrizio Leisen, 2015. "A multivariate extension of a vector of two-parameter Poisson-Dirichlet processes," Journal of Nonparametric Statistics, Taylor & Francis Journals, vol. 27(1), pages 89-105, March.
    2. Julyan Arbel & Stefano Favaro, 2021. "Approximating Predictive Probabilities of Gibbs-Type Priors," Sankhya A: The Indian Journal of Statistics, Springer;Indian Statistical Institute, vol. 83(1), pages 496-519, February.
    3. Daniel R. Kowal & Antonio Canale, 2021. "Semiparametric Functional Factor Models with Bayesian Rank Selection," Papers 2108.02151, arXiv.org, revised May 2022.
    4. Zhang, Junyi & Dassios, Angelos, 2023. "Truncated two-parameter Poisson-Dirichlet approximation for Pitman-Yor process hierarchical models," LSE Research Online Documents on Economics 120294, London School of Economics and Political Science, LSE Library.
    5. Laura Liu & Hyungsik Roger Moon & Frank Schorfheide, 2023. "Forecasting with a panel Tobit model," Quantitative Economics, Econometric Society, vol. 14(1), pages 117-159, January.
    6. Giulia Cereda, 2017. "Impact of Model Choice on LR Assessment in Case of Rare Haplotype Match (Frequentist Approach)," Scandinavian Journal of Statistics, Danish Society for Theoretical Statistics;Finnish Statistical Society;Norwegian Statistical Association;Swedish Statistical Association, vol. 44(1), pages 230-248, March.
    7. repec:dau:papers:123456789/13437 is not listed on IDEAS
    8. Canale, Antonio & Lijoi, Antonio & Nipoti, Bernardo & Prünster, Igor, 2023. "Inner spike and slab Bayesian nonparametric models," Econometrics and Statistics, Elsevier, vol. 27(C), pages 120-135.
    9. Stefano Favaro & Shui Feng & Fuqing Gao, 2018. "Moderate Deviations for Ewens-Pitman Sampling Models," Sankhya A: The Indian Journal of Statistics, Springer;Indian Statistical Institute, vol. 80(2), pages 330-341, August.
    10. Sonia Petrone & Stefano Rizzelli & Judith Rousseau & Catia Scricciolo, 2014. "Empirical Bayes methods in classical and Bayesian inference," METRON, Springer;Sapienza Università di Roma, vol. 72(2), pages 201-215, August.
    11. Emanuele Dolera, 2022. "Asymptotic Efficiency of Point Estimators in Bayesian Predictive Inference," Mathematics, MDPI, vol. 10(7), pages 1-27, April.
    12. Stefano Favaro & Bernardo Nipoti, 2014. "Discussion of “On simulation and properties of the stable law” by L. Devroye and L. James," Statistical Methods & Applications, Springer;Società Italiana di Statistica, vol. 23(3), pages 365-369, August.
    13. Emanuele Dolera & Stefano Favaro, 2021. "A Compound Poisson Perspective of Ewens–Pitman Sampling Model," Mathematics, MDPI, vol. 9(21), pages 1-12, November.
    14. Miller Jeffrey W., 2023. "Consistency of mixture models with a prior on the number of components," Dependence Modeling, De Gruyter, vol. 11(1), pages 1-9, January.
    15. Favaro, Stefano & Panero, Francesca & Rigon, Tommaso, 2021. "Bayesian nonparametric disclosure risk assessment," LSE Research Online Documents on Economics 117305, London School of Economics and Political Science, LSE Library.
    16. Pierpaolo De Blasi & Stefano Favaro & Antonio Lijoi & Ramsés H. Mena & Igor Prünster & Mattteo Ruggiero, 2013. "Are Gibbs-type priors the most natural generalization of the Dirichlet process?," DEM Working Papers Series 054, University of Pavia, Department of Economics and Management.
    17. Stefano Favaro & Antonio Lijoi & Igor Prunster, 2011. "Asymptotics for a Bayesian nonparametric estimator of species richness," Quaderni di Dipartimento 144, University of Pavia, Department of Economics and Quantitative Methods.
    18. Cesari, Oriana & Favaro, Stefano & Nipoti, Bernardo, 2014. "Posterior analysis of rare variants in Gibbs-type species sampling models," Journal of Multivariate Analysis, Elsevier, vol. 131(C), pages 79-98.
    19. Qianwen Tan & Subhashis Ghosal, 2021. "Bayesian Analysis of Mixed-effect Regression Models Driven by Ordinary Differential Equations," Sankhya B: The Indian Journal of Statistics, Springer;Indian Statistical Institute, vol. 83(1), pages 3-29, May.
    20. Igari, Ryosuke & Hoshino, Takahiro, 2018. "A Bayesian data combination approach for repeated durations under unobserved missing indicators: Application to interpurchase-timing in marketing," Computational Statistics & Data Analysis, Elsevier, vol. 126(C), pages 150-166.
    21. Jin, Xin & Maheu, John M., 2016. "Bayesian semiparametric modeling of realized covariance matrices," Journal of Econometrics, Elsevier, vol. 192(1), pages 19-39.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:vrs:demode:v:7:y:2019:i:1:p:45-52:n:3. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Peter Golla (email available below). General contact details of provider: https://www.degruyter.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.