IDEAS home Printed from https://ideas.repec.org/p/arx/papers/1811.08083.html
   My bibliography  Save this paper

Complete Subset Averaging with Many Instruments

Author

Listed:
  • Seojeong Lee
  • Youngki Shin

Abstract

We propose a two-stage least squares (2SLS) estimator whose first stage is the equal-weighted average over a complete subset with $k$ instruments among $K$ available, which we call the complete subset averaging (CSA) 2SLS. The approximate mean squared error (MSE) is derived as a function of the subset size $k$ by the Nagar (1959) expansion. The subset size is chosen by minimizing the sample counterpart of the approximate MSE. We show that this method achieves the asymptotic optimality among the class of estimators with different subset sizes. To deal with averaging over a growing set of irrelevant instruments, we generalize the approximate MSE to find that the optimal $k$ is larger than otherwise. An extensive simulation experiment shows that the CSA-2SLS estimator outperforms the alternative estimators when instruments are correlated. As an empirical illustration, we estimate the logistic demand function in Berry, Levinsohn, and Pakes (1995) and find the CSA-2SLS estimate is better supported by economic theory than the alternative estimates.

Suggested Citation

  • Seojeong Lee & Youngki Shin, 2018. "Complete Subset Averaging with Many Instruments," Papers 1811.08083, arXiv.org, revised Aug 2020.
  • Handle: RePEc:arx:papers:1811.08083
    as

    Download full text from publisher

    File URL: http://arxiv.org/pdf/1811.08083
    File Function: Latest version
    Download Restriction: no
    ---><---

    Other versions of this item:

    References listed on IDEAS

    as
    1. Stefan Wager & Susan Athey, 2018. "Estimation and Inference of Heterogeneous Treatment Effects using Random Forests," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 113(523), pages 1228-1242, July.
    2. Victor Chernozhukov & Denis Chetverikov & Mert Demirer & Esther Duflo & Christian Hansen & Whitney Newey & James Robins, 2018. "Double/debiased machine learning for treatment and structural parameters," Econometrics Journal, Royal Economic Society, vol. 21(1), pages 1-68, February.
    3. Tomohiro Ando & Ker-Chau Li, 2014. "A Model-Averaging Approach for High-Dimensional Regression," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 109(505), pages 254-265, March.
    4. Donald, Stephen G & Newey, Whitney K, 2001. "Choosing the Number of Instruments," Econometrica, Econometric Society, vol. 69(5), pages 1161-1191, September.
    5. Guido Kuersteiner & Ryo Okui, 2010. "Constructing Optimal Instruments by First-Stage Prediction Averaging," Econometrica, Econometric Society, vol. 78(2), pages 697-718, March.
    6. Victor Chernozhukov & Christian Hansen & Martin Spindler, 2015. "Post-Selection and Post-Regularization Inference in Linear Models with Many Controls and Instruments," American Economic Review, American Economic Association, vol. 105(5), pages 486-490, May.
    7. Clemen, Robert T., 1989. "Combining forecasts: A review and annotated bibliography," International Journal of Forecasting, Elsevier, vol. 5(4), pages 559-583.
    8. Berry, Steven & Levinsohn, James & Pakes, Ariel, 1995. "Automobile Prices in Market Equilibrium," Econometrica, Econometric Society, vol. 63(4), pages 841-890, July.
    9. Newey, Whitney K, 1990. "Efficient Instrumental Variables Estimation of Nonlinear Models," Econometrica, Econometric Society, vol. 58(4), pages 809-837, July.
    10. Elliott, Graham & Gargano, Antonio & Timmermann, Allan, 2015. "Complete subset regressions with large-dimensional sets of predictors," Journal of Economic Dynamics and Control, Elsevier, vol. 54(C), pages 86-110.
    11. Hansen, Bruce E. & Racine, Jeffrey S., 2012. "Jackknife model averaging," Journal of Econometrics, Elsevier, vol. 167(1), pages 38-46.
    12. Elliott, Graham & Gargano, Antonio & Timmermann, Allan, 2013. "Complete subset regressions," Journal of Econometrics, Elsevier, vol. 177(2), pages 357-373.
    13. Jeremy Smith & Kenneth F. Wallis, 2009. "A Simple Explanation of the Forecast Combination Puzzle," Oxford Bulletin of Economics and Statistics, Department of Economics, University of Oxford, vol. 71(3), pages 331-355, June.
    14. Xinyu Zhang & Dalei Yu & Guohua Zou & Hua Liang, 2016. "Optimal Model Averaging Estimation for Generalized Linear Models and Generalized Linear Mixed-Effects Models," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 111(516), pages 1775-1790, October.
    15. Mark W. Watson & James H. Stock, 2004. "Combination forecasts of output growth in a seven-country data set," Journal of Forecasting, John Wiley & Sons, Ltd., vol. 23(6), pages 405-430.
    16. Newey, Whitney K., 1997. "Convergence rates and asymptotic normality for series estimators," Journal of Econometrics, Elsevier, vol. 79(1), pages 147-168, July.
    17. Bruce E. Hansen, 2007. "Least Squares Model Averaging," Econometrica, Econometric Society, vol. 75(4), pages 1175-1189, July.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Lee, Ji Hyung & Shin, Youngki, 2023. "Complete Subset Averaging For Quantile Regressions," Econometric Theory, Cambridge University Press, vol. 39(1), pages 146-188, February.
    2. Chen, Xingyi & Li, Haiqi & Zhang, Jing, 2023. "Complete subset averaging approach for high-dimensional generalized linear models," Economics Letters, Elsevier, vol. 226(C).
    3. Islam, M.S. & Das, Barun K. & Das, Pronob & Rahaman, Md Habibur, 2021. "Techno-economic optimization of a zero emission energy system for a coastal community in Newfoundland, Canada," Energy, Elsevier, vol. 220(C).
    4. Seojeong Lee & Siha Lee & Julius Owusu & Youngki Shin, 2023. "csa2sls: A complete subset approach for many instruments using Stata," Stata Journal, StataCorp LP, vol. 23(4), pages 932-941, December.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Seojeong Lee & Youngki Shin, 2018. "Optimal Estimation with Complete Subsets of Instruments," Department of Economics Working Papers 2018-15, McMaster University.
    2. Lee, Ji Hyung & Shin, Youngki, 2023. "Complete Subset Averaging For Quantile Regressions," Econometric Theory, Cambridge University Press, vol. 39(1), pages 146-188, February.
    3. Chen, Yi-Ting & Liu, Chu-An, 2023. "Model averaging for asymptotically optimal combined forecasts," Journal of Econometrics, Elsevier, vol. 235(2), pages 592-607.
    4. Antoine Mandel & Amir Sani, 2017. "A Machine Learning Approach to the Forecast Combination Puzzle," Université Paris1 Panthéon-Sorbonne (Post-Print and Working Papers) halshs-01317974, HAL.
    5. Mark F. J. Steel, 2020. "Model Averaging and Its Use in Economics," Journal of Economic Literature, American Economic Association, vol. 58(3), pages 644-719, September.
    6. Enrique Moral-Benito, 2015. "Model Averaging In Economics: An Overview," Journal of Economic Surveys, Wiley Blackwell, vol. 29(1), pages 46-75, February.
    7. Steven F. Lehrer & Tian Xie, 2022. "The Bigger Picture: Combining Econometrics with Analytics Improves Forecasts of Movie Success," Management Science, INFORMS, vol. 68(1), pages 189-210, January.
    8. Wei Qian & Craig A. Rolling & Gang Cheng & Yuhong Yang, 2019. "On the Forecast Combination Puzzle," Econometrics, MDPI, vol. 7(3), pages 1-26, September.
    9. Sun, Yuying & Hong, Yongmiao & Wang, Shouyang & Zhang, Xinyu, 2023. "Penalized time-varying model averaging," Journal of Econometrics, Elsevier, vol. 235(2), pages 1355-1377.
    10. Cheng, Xu & Hansen, Bruce E., 2015. "Forecasting with factor-augmented regression: A frequentist model averaging approach," Journal of Econometrics, Elsevier, vol. 186(2), pages 280-293.
    11. Chen, Xingyi & Li, Haiqi & Zhang, Jing, 2023. "Complete subset averaging approach for high-dimensional generalized linear models," Economics Letters, Elsevier, vol. 226(C).
    12. Antoine Mandel & Amir Sani, 2016. "Learning Time-Varying Forecast Combinations," Documents de travail du Centre d'Economie de la Sorbonne 16036, Université Panthéon-Sorbonne (Paris 1), Centre d'Economie de la Sorbonne.
    13. Xu Cheng & Bruce E. Hansen, 2012. "Forecasting with Factor-Augmented Regression: A Frequentist Model Averaging Approach, Second Version," PIER Working Paper Archive 13-061, Penn Institute for Economic Research, Department of Economics, University of Pennsylvania, revised 03 Sep 2013.
    14. Greenaway-McGrevy, Ryan, 2022. "Forecast combination for VARs in large N and T panels," International Journal of Forecasting, Elsevier, vol. 38(1), pages 142-164.
    15. Chunrong Ai & Oliver Linton & Kaiji Motegi & Zheng Zhang, 2021. "A unified framework for efficient estimation of general treatment models," Quantitative Economics, Econometric Society, vol. 12(3), pages 779-816, July.
    16. Haili Zhang & Guohua Zou, 2020. "Cross-Validation Model Averaging for Generalized Functional Linear Model," Econometrics, MDPI, vol. 8(1), pages 1-35, February.
    17. Qingliang Fan & Yaqian Wu, 2020. "Endogenous Treatment Effect Estimation with some Invalid and Irrelevant Instruments," Papers 2006.14998, arXiv.org.
    18. Zhao, Shangwei & Xie, Tian & Ai, Xin & Yang, Guangren & Zhang, Xinyu, 2023. "Correcting sample selection bias with model averaging for consumer demand forecasting," Economic Modelling, Elsevier, vol. 123(C).
    19. Ricardo P. Masini & Marcelo C. Medeiros & Eduardo F. Mendes, 2023. "Machine learning advances for time series forecasting," Journal of Economic Surveys, Wiley Blackwell, vol. 37(1), pages 76-111, February.
    20. Rongjie Jiang & Liming Wang & Yang Bai, 2021. "Optimal model averaging estimator for semi-functional partially linear models," Metrika: International Journal for Theoretical and Applied Statistics, Springer, vol. 84(2), pages 167-194, February.

    More about this item

    NEP fields

    This paper has been announced in the following NEP Reports:

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:arx:papers:1811.08083. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: arXiv administrators (email available below). General contact details of provider: http://arxiv.org/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.