IDEAS home Printed from https://ideas.repec.org/p/pra/mprapa/38698.html
   My bibliography  Save this paper

Endogeneity in ultrahigh dimension

Author

Listed:
  • Fan, Jianqing
  • Liao, Yuan

Abstract

Most papers on high-dimensional statistics are based on the assumption that none of the regressors are correlated with the regression error, namely, they are exogenous. Yet, endogeneity arises easily in high-dimensional regression due to a large pool of regressors and this causes the inconsistency of the penalized least-squares methods and possible false scientic discoveries. A necessary condition for model selection of a very general class of penalized regression methods is given, which allows us to prove formally the inconsistency claim. To cope with the possible endogeneity, we construct a novel penalized focussed generalized method of moments (FGMM) criterion function and oer a new optimization algorithm. The FGMM is not a smooth function. To establish its asymptotic properties, we rst study the model selection consistency and an oracle property for a general class of penalized regression methods. These results are then used to show that the FGMM possesses an oracle property even in the presence of endogenous predictors, and that the solution is also near global minimum under the over-identication assumption. Finally, we also show how the semi-parametric efficiency of estimation can be achieved via a two-step approach.

Suggested Citation

  • Fan, Jianqing & Liao, Yuan, 2012. "Endogeneity in ultrahigh dimension," MPRA Paper 38698, University Library of Munich, Germany.
  • Handle: RePEc:pra:mprapa:38698
    as

    Download full text from publisher

    File URL: https://mpra.ub.uni-muenchen.de/38698/1/MPRA_paper_38698.pdf
    File Function: original version
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Hansen, Lars Peter, 1982. "Large Sample Properties of Generalized Method of Moments Estimators," Econometrica, Econometric Society, vol. 50(4), pages 1029-1054, July.
    2. Jelena Bradic & Jianqing Fan & Weiwei Wang, 2011. "Penalized composite quasi‐likelihood for ultrahigh dimensional variable selection," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 73(3), pages 325-349, June.
    3. Donald W. K. Andrews, 1999. "Consistent Moment Selection Procedures for Generalized Method of Moments Estimation," Econometrica, Econometric Society, vol. 67(3), pages 543-564, May.
    4. Fan J. & Li R., 2001. "Variable Selection via Nonconcave Penalized Likelihood and its Oracle Properties," Journal of the American Statistical Association, American Statistical Association, vol. 96, pages 1348-1360, December.
    5. Yuichi Kitamura & Gautam Tripathi & Hyungtaik Ahn, 2004. "Empirical Likelihood-Based Inference in Conditional Moment Restriction Models," Econometrica, Econometric Society, vol. 72(6), pages 1667-1714, November.
    6. Andrews, Donald W. K. & Lu, Biao, 2001. "Consistent model and moment selection procedures for GMM estimation with application to dynamic panel data models," Journal of Econometrics, Elsevier, vol. 101(1), pages 123-164, March.
    7. Severini, Thomas A. & Tripathi, Gautam, 2001. "A simplified approach to computing efficiency bounds in semiparametric models," Journal of Econometrics, Elsevier, vol. 102(1), pages 23-66, May.
    8. Zou, Hui, 2006. "The Adaptive Lasso and Its Oracle Properties," Journal of the American Statistical Association, American Statistical Association, vol. 101, pages 1418-1429, December.
    9. Eric Gautier & Alexandre Tsybakov, 2011. "High-Dimensional Instrumental Variables Regression and Confidence Sets," Working Papers 2011-13, Center for Research in Economics and Statistics.
    10. P. Bühlmann & M. Kalisch & M. H. Maathuis, 2010. "Variable selection in high-dimensional linear models: partially faithful distributions and the pc -simple algorithm," Biometrika, Biometrika Trust, vol. 97(2), pages 261-278.
    11. Liao, Zhipeng, 2013. "Adaptive Gmm Shrinkage Estimation With Consistent Moment Selection," Econometric Theory, Cambridge University Press, vol. 29(5), pages 857-904, October.
    12. Donald, Stephen G. & Imbens, Guido W. & Newey, Whitney K., 2003. "Empirical likelihood estimation and consistent tests with conditional moment restrictions," Journal of Econometrics, Elsevier, vol. 117(1), pages 55-93, November.
    13. Chamberlain, Gary, 1987. "Asymptotic efficiency in estimation with conditional moment restrictions," Journal of Econometrics, Elsevier, vol. 34(3), pages 305-334, March.
    14. Caner, Mehmet, 2009. "Lasso-Type Gmm Estimator," Econometric Theory, Cambridge University Press, vol. 25(1), pages 270-290, February.
    15. Jianqing Fan & Jinchi Lv, 2008. "Sure independence screening for ultrahigh dimensional feature space," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 70(5), pages 849-911, November.
    16. Hui Zou & Trevor Hastie, 2005. "Addendum: Regularization and variable selection via the elastic net," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 67(5), pages 768-768, November.
    17. Horowitz, Joel L, 1992. "A Smoothed Maximum Score Estimator for the Binary Response Model," Econometrica, Econometric Society, vol. 60(3), pages 505-531, May.
    18. Hui Zou & Trevor Hastie, 2005. "Regularization and variable selection via the elastic net," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 67(2), pages 301-320, April.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Chang, Jinyuan & Chen, Song Xi & Chen, Xiaohong, 2015. "High dimensional generalized empirical likelihood for moment restrictions with dependent data," Journal of Econometrics, Elsevier, vol. 185(1), pages 283-304.
    2. Zhu, Ying, 2015. "Sparse Linear Models and l1−Regularized 2SLS with High-Dimensional Endogenous Regressors and Instruments," MPRA Paper 81217, University Library of Munich, Germany.
    3. Mehmet Caner & Xu Han & Yoonseok Lee, 2018. "Adaptive Elastic Net GMM Estimation With Many Invalid Moment Conditions: Simultaneous Model and Moment Selection," Journal of Business & Economic Statistics, Taylor & Francis Journals, vol. 36(1), pages 24-46, January.
    4. Lu, Xun & Su, Liangjun, 2016. "Shrinkage estimation of dynamic panel data models with interactive fixed effects," Journal of Econometrics, Elsevier, vol. 190(1), pages 148-175.
    5. Achim Ahrens & Arnab Bhattacharjee, 2015. "Two-Step Lasso Estimation of the Spatial Weights Matrix," Econometrics, MDPI, vol. 3(1), pages 1-28, March.
    6. Ben Gillen & Erik Snowberg & Leeat Yariv, 2015. "Experimenting with Measurement Error: Techniques with Applications to the Caltech Cohort Study," NBER Working Papers 21517, National Bureau of Economic Research, Inc.
    7. Task Force Members Include: Lilli Japec & Frauke Kreuter & Marcus Berg & Paul Biemer & Paul Decker & Cliff Lampe & Julia Lane & Cathy O'Neil & Abe Usher, "undated". "AAPOR Report on Big Data," Mathematica Policy Research Reports 4eb9b798fd5b42a8b53a9249c, Mathematica Policy Research.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Mehmet Caner & Xu Han & Yoonseok Lee, 2018. "Adaptive Elastic Net GMM Estimation With Many Invalid Moment Conditions: Simultaneous Model and Moment Selection," Journal of Business & Economic Statistics, Taylor & Francis Journals, vol. 36(1), pages 24-46, January.
    2. Xu Cheng & Zhipeng Liao, 2012. "Select the Valid and Relevant Moments: A One-Step Procedure for GMM with Many Moments," PIER Working Paper Archive 12-045, Penn Institute for Economic Research, Department of Economics, University of Pennsylvania.
    3. Aman Ullah & Huansha Wang, 2013. "Parametric and Nonparametric Frequentist Model Selection and Model Averaging," Econometrics, MDPI, vol. 1(2), pages 1-23, September.
    4. Lee, Ji Hyung & Shi, Zhentao & Gao, Zhan, 2022. "On LASSO for predictive regression," Journal of Econometrics, Elsevier, vol. 229(2), pages 322-349.
    5. Qingliang Fan & Yaqian Wu, 2020. "Endogenous Treatment Effect Estimation with some Invalid and Irrelevant Instruments," Papers 2006.14998, arXiv.org.
    6. Alena Skolkova, 2023. "Instrumental Variable Estimation with Many Instruments Using Elastic-Net IV," CERGE-EI Working Papers wp759, The Center for Economic Research and Graduate Education - Economics Institute, Prague.
    7. Gerda Claeskens, 2012. "Focused estimation and model averaging with penalization methods: an overview," Statistica Neerlandica, Netherlands Society for Statistics and Operations Research, vol. 66(3), pages 272-287, August.
    8. Ando, Tomohiro & Sueishi, Naoya, 2019. "Regularization parameter selection for penalized empirical likelihood estimator," Economics Letters, Elsevier, vol. 178(C), pages 1-4.
    9. Cheng, Xu & Liao, Zhipeng, 2015. "Select the valid and relevant moments: An information-based LASSO for GMM with many moments," Journal of Econometrics, Elsevier, vol. 186(2), pages 443-464.
    10. Umberto Amato & Anestis Antoniadis & Italia De Feis & Irene Gijbels, 2021. "Penalised robust estimators for sparse and high-dimensional linear models," Statistical Methods & Applications, Springer;Società Italiana di Statistica, vol. 30(1), pages 1-48, March.
    11. Wang, Christina Dan & Chen, Zhao & Lian, Yimin & Chen, Min, 2022. "Asset selection based on high frequency Sharpe ratio," Journal of Econometrics, Elsevier, vol. 227(1), pages 168-188.
    12. Peter Bühlmann & Jacopo Mandozzi, 2014. "High-dimensional variable screening and bias in subsequent inference, with an empirical comparison," Computational Statistics, Springer, vol. 29(3), pages 407-430, June.
    13. Loann David Denis Desboulets, 2018. "A Review on Variable Selection in Regression Analysis," Econometrics, MDPI, vol. 6(4), pages 1-27, November.
    14. Zhang, Ting & Wang, Lei, 2020. "Smoothed empirical likelihood inference and variable selection for quantile regression with nonignorable missing response," Computational Statistics & Data Analysis, Elsevier, vol. 144(C).
    15. Jingxuan Luo & Lili Yue & Gaorong Li, 2023. "Overview of High-Dimensional Measurement Error Regression Models," Mathematics, MDPI, vol. 11(14), pages 1-22, July.
    16. Antoine, Bertille & Bonnal, Helene & Renault, Eric, 2007. "On the efficient use of the informational content of estimating equations: Implied probabilities and Euclidean empirical likelihood," Journal of Econometrics, Elsevier, vol. 138(2), pages 461-487, June.
    17. Tan, Xin Lu, 2019. "Optimal estimation of slope vector in high-dimensional linear transformation models," Journal of Multivariate Analysis, Elsevier, vol. 169(C), pages 179-204.
    18. Ricardo P. Masini & Marcelo C. Medeiros & Eduardo F. Mendes, 2023. "Machine learning advances for time series forecasting," Journal of Economic Surveys, Wiley Blackwell, vol. 37(1), pages 76-111, February.
    19. Lewbel, Arthur & Choi, Jin Young & Zhou, Zhuzhu, 2023. "Over-identified Doubly Robust identification and estimation," Journal of Econometrics, Elsevier, vol. 235(1), pages 25-42.
    20. Chen, Shi & Härdle, Wolfgang Karl & López Cabrera, Brenda, 2018. "Regularization Approach for Network Modeling of German Energy Market," IRTG 1792 Discussion Papers 2018-017, Humboldt University of Berlin, International Research Training Group 1792 "High Dimensional Nonstationary Time Series".

    More about this item

    Keywords

    Focused GMM; Sparsity recovery; Endogenous variables; Oracle property; Conditional moment restriction; Estimating equation; Over identi cation; Global minimization; Semi-parametric efficiency;
    All these keywords.

    JEL classification:

    • C13 - Mathematical and Quantitative Methods - - Econometric and Statistical Methods and Methodology: General - - - Estimation: General
    • C52 - Mathematical and Quantitative Methods - - Econometric Modeling - - - Model Evaluation, Validation, and Selection
    • C01 - Mathematical and Quantitative Methods - - General - - - Econometrics

    NEP fields

    This paper has been announced in the following NEP Reports:

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:pra:mprapa:38698. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Joachim Winter (email available below). General contact details of provider: https://edirc.repec.org/data/vfmunde.html .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.