IDEAS home Printed from https://ideas.repec.org/p/arx/papers/2103.00557.html
   My bibliography  Save this paper

Algorithmic subsampling under multiway clustering

Author

Listed:
  • Harold D. Chiang
  • Jiatong Li
  • Yuya Sasaki

Abstract

This paper proposes a novel method of algorithmic subsampling (data sketching) for multiway cluster dependent data. We establish a new uniform weak law of large numbers and a new central limit theorem for the multiway algorithmic subsample means. Consequently, we discover an additional advantage of the algorithmic subsampling that it allows for robustness against potential degeneracy, and even non-Gaussian degeneracy, of the asymptotic distribution under multiway clustering. Simulation studies support this novel result, and demonstrate that inference with the algorithmic subsampling entails more accuracy than that without the algorithmic subsampling. Applying these basic asymptotic theories, we derive the consistency and the asymptotic normality for the multiway algorithmic subsampling generalized method of moments estimator and for the multiway algorithmic subsampling M-estimator. We illustrate an application to scanner data.

Suggested Citation

  • Harold D. Chiang & Jiatong Li & Yuya Sasaki, 2021. "Algorithmic subsampling under multiway clustering," Papers 2103.00557, arXiv.org, revised Oct 2022.
  • Handle: RePEc:arx:papers:2103.00557
    as

    Download full text from publisher

    File URL: http://arxiv.org/pdf/2103.00557
    File Function: Latest version
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Harold D. Chiang & Kengo Kato & Yukun Ma & Yuya Sasaki, 2022. "Multiway Cluster Robust Double/Debiased Machine Learning," Journal of Business & Economic Statistics, Taylor & Francis Journals, vol. 40(3), pages 1046-1056, June.
    2. Djogbenou, Antoine A. & MacKinnon, James G. & Nielsen, Morten Ørregaard, 2019. "Asymptotic theory and wild bootstrap inference with clustered errors," Journal of Econometrics, Elsevier, vol. 212(2), pages 393-412.
    3. Thompson, Samuel B., 2011. "Simple formulas for standard errors that cluster by both firm and time," Journal of Financial Economics, Elsevier, vol. 99(1), pages 1-10, January.
    4. Laurent Davezies & Xavier D'Haultfoeuille & Yannick Guyonvarch, 2018. "Asymptotic results under multiway clustering," Papers 1807.07925, arXiv.org, revised Aug 2018.
    5. Harold D. Chiang & Kengo Kato & Yuya Sasaki, 2023. "Inference for High-Dimensional Exchangeable Arrays," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 118(543), pages 1595-1605, July.
    6. Serena Ng, 2017. "Opportunities and Challenges: Lessons from Analyzing Terabytes of Scanner Data," NBER Working Papers 23673, National Bureau of Economic Research, Inc.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. David Roodman & James G. MacKinnon & Morten Ørregaard Nielsen & Matthew D. Webb, 2019. "Fast and wild: Bootstrap inference in Stata using boottest," Stata Journal, StataCorp LP, vol. 19(1), pages 4-60, March.
    2. MacKinnon, James G. & Nielsen, Morten Ørregaard & Webb, Matthew D., 2023. "Cluster-robust inference: A guide to empirical practice," Journal of Econometrics, Elsevier, vol. 232(2), pages 272-299.
    3. James G. MacKinnon & Morten Ørregaard Nielsen & Matthew D. Webb, 2021. "Wild Bootstrap and Asymptotic Inference With Multiway Clustering," Journal of Business & Economic Statistics, Taylor & Francis Journals, vol. 39(2), pages 505-519, March.
    4. James G. MacKinnon & Matthew D. Webb, 2020. "When and How to Deal with Clustered Errors in Regression Models," Working Paper 1421, Economics Department, Queen's University.
    5. James G. MacKinnon, 2019. "How cluster‐robust inference is changing applied econometrics," Canadian Journal of Economics/Revue canadienne d'économique, John Wiley & Sons, vol. 52(3), pages 851-881, August.
    6. Bruno Ferman, 2023. "Inference in difference‐in‐differences: How much should we trust in independent clusters?," Journal of Applied Econometrics, John Wiley & Sons, Ltd., vol. 38(3), pages 358-369, April.
    7. Harold D Chiang & Yukun Ma & Joel Rodrigue & Yuya Sasaki, 2021. "Dyadic double/debiased machine learning for analyzing determinants of free trade agreements," Papers 2110.04365, arXiv.org, revised Dec 2022.
    8. Matthew D. Webb, 2023. "Reworking wild bootstrap‐based inference for clustered errors," Canadian Journal of Economics/Revue canadienne d'économique, John Wiley & Sons, vol. 56(3), pages 839-858, August.
    9. Harold D. Chiang & Kengo Kato & Yuya Sasaki, 2023. "Inference for High-Dimensional Exchangeable Arrays," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 118(543), pages 1595-1605, July.
    10. Harold D Chiang & Bruce E Hansen & Yuya Sasaki, 2022. "Standard errors for two-way clustering with serially correlated time effects," Papers 2201.11304, arXiv.org, revised Dec 2023.
    11. Harold D Chiang & Yukitoshi Matsushita & Taisuke Otsu, 2021. "Multiway empirical likelihood," STICERD - Econometrics Paper Series 617, Suntory and Toyota International Centres for Economics and Related Disciplines, LSE.
    12. Greg Hebb, 2021. "On the performance of Bank-managed mutual funds: Canadian evidence," Journal of Economics and Finance, Springer;Academy of Economics and Finance, vol. 45(1), pages 22-48, January.
    13. Caginalp, Gunduz & DeSantis, Mark, 2017. "Does price efficiency increase with trading volume? Evidence of nonlinearity and power laws in ETFs," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 467(C), pages 436-452.
    14. Yue Zhao & Difang Wan, 2018. "Institutional high frequency trading and price discovery: Evidence from an emerging commodity futures market," Journal of Futures Markets, John Wiley & Sons, Ltd., vol. 38(2), pages 243-270, February.
    15. Douglas Cumming & Lars Hornuf & Moein Karami & Denis Schweizer, 2023. "Disentangling Crowdfunding from Fraudfunding," Journal of Business Ethics, Springer, vol. 182(4), pages 1103-1128, February.
    16. Laurent Davezies & Xavier D'Haultfoeuille & Yannick Guyonvarch, 2019. "Empirical Process Results for Exchangeable Arrays," Papers 1906.11293, arXiv.org, revised May 2020.
    17. Boon, L.N. & Brière, M. & Rigot, S., 2018. "Regulation and pension fund risk-taking," Journal of International Money and Finance, Elsevier, vol. 84(C), pages 23-41.
    18. Andrea Orame, 2020. "The role of bank supply in the Italian credit market: evidence from a new regional survey," Temi di discussione (Economic working papers) 1279, Bank of Italy, Economic Research and International Relations Area.
    19. Chue, Timothy K. & Gul, Ferdinand A. & Mian, G. Mujtaba, 2019. "Aggregate investor sentiment and stock return synchronicity," Journal of Banking & Finance, Elsevier, vol. 108(C).
    20. Rungmaitree, Pattamon & Boateng, Agyenim & Ahiabor, Frederick & Lu, Qinye, 2022. "Political risk, hedge fund strategies, and returns: Evidence from G7 countries," Journal of International Financial Markets, Institutions and Money, Elsevier, vol. 81(C).

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:arx:papers:2103.00557. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: arXiv administrators (email available below). General contact details of provider: http://arxiv.org/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.