IDEAS home Printed from https://ideas.repec.org/a/spr/compst/v35y2020i2d10.1007_s00180-019-00899-7.html
   My bibliography  Save this article

Random sampling of contingency tables via probabilistic divide-and-conquer

Author

Listed:
  • Stephen DeSalvo

    (UCLA Department of Mathematics)

  • James Zhao

    (USC Department of Mathematics)

Abstract

We present a new approach for random sampling of contingency tables of any size and constraints based on a recently introduced probabilistic divide-and-conquer (PDC) technique. Our first application is a recursive PDC: it samples the least significant bit of each entry in the table, motivated by the fact that the bits of a geometric random variable are independent. The second application is via PDC deterministic second half, where one divides the sample space into two pieces, one of which is deterministic conditional on the other; this approach is highlighted via an exact sampling algorithm in the $$2\times n$$2×n case. Finally, we also present a generalization to the sampling algorithm where each entry of the table has a specified marginal distribution.

Suggested Citation

  • Stephen DeSalvo & James Zhao, 2020. "Random sampling of contingency tables via probabilistic divide-and-conquer," Computational Statistics, Springer, vol. 35(2), pages 837-869, June.
  • Handle: RePEc:spr:compst:v:35:y:2020:i:2:d:10.1007_s00180-019-00899-7
    DOI: 10.1007/s00180-019-00899-7
    as

    Download full text from publisher

    File URL: http://link.springer.com/10.1007/s00180-019-00899-7
    File Function: Abstract
    Download Restriction: Access to the full text of the articles in this series is restricted.

    File URL: https://libkey.io/10.1007/s00180-019-00899-7?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Yuguo Chen & Persi Diaconis & Susan P. Holmes & Jun S. Liu, 2005. "Sequential Monte Carlo Methods for Statistical Analysis of Tables," Journal of the American Statistical Association, American Statistical Association, vol. 100, pages 109-120, March.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Dyer, Martin & Greenhill, Catherine & Kleer, Pieter & Ross, James & Stougie, Leen, 2021. "Sampling hypergraphs with given degrees," Other publications TiSEM 2d323767-8066-4c28-92cc-b, Tilburg University, School of Economics and Management.
    2. Zhang, Linjun & Small, Michael & Judd, Kevin, 2015. "Exactly scale-free scale-free networks," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 433(C), pages 182-197.
    3. Ian Dinwoodie & Kruti Pandya, 2015. "Exact tests for singular network data," Annals of the Institute of Statistical Mathematics, Springer;The Institute of Statistical Mathematics, vol. 67(4), pages 687-706, August.
    4. Jing Xi & Ruriko Yoshida & David Haws, 2013. "Estimating the number of zero-one multi-way tables via sequential importance sampling," Annals of the Institute of Statistical Mathematics, Springer;The Institute of Statistical Mathematics, vol. 65(4), pages 763-783, August.
    5. Clemens Draxler, 2018. "Bayesian conditional inference for Rasch models," AStA Advances in Statistical Analysis, Springer;German Statistical Society, vol. 102(2), pages 245-262, April.
    6. Ogawa, Mitsunori & Takemura, Akimichi, 2012. "Markov bases for typical block effect models of two-way contingency tables," Journal of Multivariate Analysis, Elsevier, vol. 112(C), pages 219-229.
    7. Sanda Micula, 2015. "Statistical Computer Simulations And Monte Carlo Methods," Romanian Economic Business Review, Romanian-American University, vol. 9(2), pages 384-394, December.
    8. Bartolucci, Francesco & Pigini, Claudia & Valentini, Francesco, 2021. "MCMC Conditional Maximum Likelihood for the two-way fixed-effects logit," MPRA Paper 110034, University Library of Munich, Germany.
    9. George Fishman, 2014. "Counting subsets of contingency tables," Computational Statistics, Springer, vol. 29(1), pages 159-187, February.
    10. Yuguo Chen & Dylan Small, 2005. "Exact tests for the rasch model via sequential importance sampling," Psychometrika, Springer;The Psychometric Society, vol. 70(1), pages 11-30, March.
    11. David Kahle & Ruriko Yoshida & Luis Garcia-Puente, 2018. "Hybrid schemes for exact conditional inference in discrete exponential families," Annals of the Institute of Statistical Mathematics, Springer;The Institute of Statistical Mathematics, vol. 70(5), pages 983-1011, October.
    12. Radislav Vaisman & Zdravko Botev & Ad Ridder, 2013. "Sequential Monte Carlo for Counting Vertex Covers in General Graphs," Tinbergen Institute Discussion Papers 13-122/III, Tinbergen Institute.
    13. Radislav Vaisman & Dirk P. Kroese, 2017. "Stochastic Enumeration Method for Counting Trees," Methodology and Computing in Applied Probability, Springer, vol. 19(1), pages 31-73, March.
    14. Rivest, Louis-Paul, 2021. "Limiting properties of an equiprobable sampling scheme for 0–1 matrices," Statistics & Probability Letters, Elsevier, vol. 172(C).
    15. repec:jss:jstsof:20:i04 is not listed on IDEAS
    16. Verhelst, Norman & Hatzinger, Reinhold & Mair, Patrick, 2007. "The Rasch Sampler," Journal of Statistical Software, Foundation for Open Access Statistics, vol. 20(i04).
    17. repec:jss:jstsof:17:i07 is not listed on IDEAS
    18. Norman Verhelst, 2008. "An Efficient MCMC Algorithm to Sample Binary Matrices with Fixed Marginals," Psychometrika, Springer;The Psychometric Society, vol. 73(4), pages 705-728, December.
    19. Radislav Vaisman & Ofer Strichman & Ilya Gertsbakh, 2015. "Model Counting of Monotone Conjunctive Normal Form Formulas with Spectra," INFORMS Journal on Computing, INFORMS, vol. 27(2), pages 406-415, May.
    20. Drew Creal, 2012. "A Survey of Sequential Monte Carlo Methods for Economics and Finance," Econometric Reviews, Taylor & Francis Journals, vol. 31(3), pages 245-296.
    21. Caffo, Brian, 2006. "Exact Hypothesis Tests for Log-linear Models with exactLoglinTest," Journal of Statistical Software, Foundation for Open Access Statistics, vol. 17(i07).

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:spr:compst:v:35:y:2020:i:2:d:10.1007_s00180-019-00899-7. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.springer.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.