IDEAS home Printed from https://ideas.repec.org/p/iza/izadps/dp16594.html
   My bibliography  Save this paper

Consistent Estimation of Panel Data Sample Selection Models

Author

Listed:
  • Baltagi, Badi H.

    (Syracuse University)

  • Jimenez-Martin, Sergi

    (Universitat Pompeu Fabra)

  • Labeaga, José M.

    (UNED)

  • al Sadoon, Majid

    (Durham University Business School)

Abstract

The properties of classical panel data estimators including fixed effect, first-differences, random effects, and generalized method of moments-instrumental variables estimators in both static as well as dynamic panel data models are investigated under sample selection. The correlation of the unobserved errors is shown not to be sufficient for the inconsistency of these estimators. A necessary condition for this to arise is the presence of common (and/or non-independent) non-deterministic covariates in the selection and outcome equations. When both equations do not have covariates in common and independent of each other, the fixed effects, and random effects estimators in static models with exogenous covariates are consistent. Furthermore, the first-differenced generalized method of moments estimator uncorrected for sample selection as well as the instrumental variables estimator uncorrected for sample selection are both consistent for autoregressive models even with endogenous covariates. The same results hold when both equations have no covariates in common but are correlated once we account for such correlation. Under the same circumstances, the system generalized method of moments estimator adding more moments from the levels equation has moderate bias. Alternatively, when both equations have common covariates the appropriate correction method is suggested. Serial correlation of the errors being a key determinant for that choice. The finite sample properties of the proposed estimators are evaluated using a Monte Carlo study. Two empirical illustrations are provided.

Suggested Citation

  • Baltagi, Badi H. & Jimenez-Martin, Sergi & Labeaga, José M. & al Sadoon, Majid, 2023. "Consistent Estimation of Panel Data Sample Selection Models," IZA Discussion Papers 16594, Institute of Labor Economics (IZA).
  • Handle: RePEc:iza:izadps:dp16594
    as

    Download full text from publisher

    File URL: https://docs.iza.org/dp16594.pdf
    Download Restriction: no
    ---><---

    Other versions of this item:

    References listed on IDEAS

    as
    1. Hansen, Lars Peter, 1982. "Large Sample Properties of Generalized Method of Moments Estimators," Econometrica, Econometric Society, vol. 50(4), pages 1029-1054, July.
    2. Powell, James L, 1986. "Symmetrically Trimmed Least Squares Estimation for Tobit Models," Econometrica, Econometric Society, vol. 54(6), pages 1435-1460, November.
    3. Gayle, George-Levi & Viauroux, Christelle, 2007. "Root-N consistent semiparametric estimators of a dynamic panel-sample-selection model," Journal of Econometrics, Elsevier, vol. 141(1), pages 179-212, November.
    4. Jorge González-Chapela, 2007. "On the Price of Recreation Goods as a Determinant of Male Labor Supply," Journal of Labor Economics, University of Chicago Press, vol. 25(4), pages 795-824.
    5. Holtz-Eakin, Douglas & Newey, Whitney & Rosen, Harvey S, 1988. "Estimating Vector Autoregressions with Panel Data," Econometrica, Econometric Society, vol. 56(6), pages 1371-1395, November.
    6. Charlier, Erwin & Melenberg, Bertrand & van Soest, Arthur, 2001. "An analysis of housing expenditure using semiparametric models and panel data," Journal of Econometrics, Elsevier, vol. 101(1), pages 71-107, March.
    7. Terence C. Cheng & Pravin K. Trivedi, 2015. "Attrition Bias in Panel Data: A Sheep in Wolf's Clothing? A Case Study Based on the Mabel Survey," Health Economics, John Wiley & Sons, Ltd., vol. 24(9), pages 1101-1117, September.
    8. Wooldridge, Jeffrey M., 1995. "Selection corrections for panel data models under conditional mean independence assumptions," Journal of Econometrics, Elsevier, vol. 68(1), pages 115-132, July.
    9. Ekaterini Kyriazidou, 1997. "Estimation of a Panel Data Sample Selection Model," Econometrica, Econometric Society, vol. 65(6), pages 1335-1364, November.
    10. David Roodman, 2009. "A Note on the Theme of Too Many Instruments," Oxford Bulletin of Economics and Statistics, Department of Economics, University of Oxford, vol. 71(1), pages 135-158, February.
    11. Semykina, Anastasia & Wooldridge, Jeffrey M., 2010. "Estimating panel data models in the presence of endogeneity and selection," Journal of Econometrics, Elsevier, vol. 157(2), pages 375-380, August.
    12. Labeaga, Jose M., 1999. "A double-hurdle rational addiction model with heterogeneity: Estimating the demand for tobacco," Journal of Econometrics, Elsevier, vol. 93(1), pages 49-72, November.
    13. Polachek, Solomon W., 2008. "Earnings Over the Life Cycle: The Mincer Earnings Function and Its Applications," Foundations and Trends(R) in Microeconomics, now publishers, vol. 4(3), pages 165-272, April.
    14. Ekaterini Kyriazidou, 2001. "Estimation of Dynamic Panel Data Sample Selection Models," The Review of Economic Studies, Review of Economic Studies Ltd, vol. 68(3), pages 543-572.
    15. Manuel Arellano & Olympia Bover & José M. Labeaga, 1997. "Authoregressive Models with Sample Selectivity for Panel Data," Working Papers wp1997_9706, CEMFI.
    16. Andrew M. Jones & José M. Labeaga, 2003. "Individual heterogeneity and censoring in panel data estimates of tobacco expenditure," Journal of Applied Econometrics, John Wiley & Sons, Ltd., vol. 18(2), pages 157-177.
    17. Verbeek, Marno & Nijman, Theo, 1992. "Testing for Selectivity Bias in Panel Data Models," International Economic Review, Department of Economics, University of Pennsylvania and Osaka University Institute of Social and Economic Research Association, vol. 33(3), pages 681-703, August.
    18. Becker, Gary S & Grossman, Michael & Murphy, Kevin M, 1994. "An Empirical Analysis of Cigarette Addiction," American Economic Review, American Economic Association, vol. 84(3), pages 396-418, June.
    19. Gary Chamberlain, 1980. "Analysis of Covariance with Qualitative Data," The Review of Economic Studies, Review of Economic Studies Ltd, vol. 47(1), pages 225-238.
    20. Mark B. Stewart, 2007. "The interrelated dynamics of unemployment and low-wage employment," Journal of Applied Econometrics, John Wiley & Sons, Ltd., vol. 22(3), pages 511-531.
    21. Hung-Pin Lai & Wen-Jen Tsay, 2018. "Maximum simulated likelihood estimation of the panel sample selection model," Econometric Reviews, Taylor & Francis Journals, vol. 37(7), pages 744-759, August.
    22. repec:adr:anecst:y:1999:i:55-56:p:06 is not listed on IDEAS
    23. Hayakawa, Kazuhiko, 2007. "Small sample bias properties of the system GMM estimator in dynamic panel data models," Economics Letters, Elsevier, vol. 95(1), pages 32-38, April.
    24. Arellano, Manuel & Bover, Olympia, 1995. "Another look at the instrumental variable estimation of error-components models," Journal of Econometrics, Elsevier, vol. 68(1), pages 29-51, July.
    25. Joseph V. Terza, 2016. "Simpler standard errors for two-stage optimization estimators estimation in normal linear models," Stata Journal, StataCorp LP, vol. 16(2), pages 368-385, June.
    26. Wladimir Raymond & Pierre Mohnen & Franz Palm & Sybrand Schim van der Loeff, 2007. "The Behavior of the Maximum Likelihood Estimator of Dynamic Panel Data Sample Selection Models," CESifo Working Paper Series 1992, CESifo.
    27. Jeffrey M Wooldridge, 2010. "Econometric Analysis of Cross Section and Panel Data," MIT Press Books, The MIT Press, edition 2, volume 1, number 0262232588, December.
    28. Anderson, T. W. & Hsiao, Cheng, 1982. "Formulation and estimation of dynamic models using panel data," Journal of Econometrics, Elsevier, vol. 18(1), pages 47-82, January.
    29. Olsen, Randall J, 1980. "A Least Squares Correction for Selectivity Bias," Econometrica, Econometric Society, vol. 48(7), pages 1815-1820, November.
    30. Hsiao,Cheng & Pesaran,M. Hashem & Lahiri,Kajal & Lee,Lung Fei (ed.), 1999. "Analysis of Panels and Limited Dependent Variable Models," Cambridge Books, Cambridge University Press, number 9780521631693.
    31. Blundell, Richard & Bond, Stephen, 1998. "Initial conditions and moment restrictions in dynamic panel data models," Journal of Econometrics, Elsevier, vol. 87(1), pages 115-143, August.
    32. Anastasia Semykina & Jeffrey M. Wooldridge, 2013. "Estimation of dynamic panel data models with sample selection," Journal of Applied Econometrics, John Wiley & Sons, Ltd., vol. 28(1), pages 47-61, January.
    33. Sasaki, Yuya, 2015. "Heterogeneity and selection in dynamic panel data," Journal of Econometrics, Elsevier, vol. 188(1), pages 236-249.
    34. O. Ashenfelter & D. Card (ed.), 1999. "Handbook of Labor Economics," Handbook of Labor Economics, Elsevier, edition 1, volume 3, number 3.
    35. Olympia Bover & Manuel Arellano, 1997. "Estimating limited dependent variable models from panel data," Investigaciones Economicas, Fundación SEPI, vol. 21(2), pages 141-166, May.
    36. Anastasia Semykina & Jeffrey M. Wooldridge, 2018. "Binary response panel data models with sample selection and self‐selection," Journal of Applied Econometrics, John Wiley & Sons, Ltd., vol. 33(2), pages 179-197, March.
    37. David Roodman, 2006. "How to Do xtabond2," North American Stata Users' Group Meetings 2006 8, Stata Users Group.
    38. Luojia Hu, 2002. "Estimation of a Censored Dynamic Panel Data Model," Econometrica, Econometric Society, vol. 70(6), pages 2499-2517, November.
    39. Badi H. Baltagi, 2021. "Econometric Analysis of Panel Data," Springer Texts in Business and Economics, Springer, edition 6, number 978-3-030-53953-5, August.
    40. Mundlak, Yair, 1978. "On the Pooling of Time Series and Cross Section Data," Econometrica, Econometric Society, vol. 46(1), pages 69-85, January.
    41. Nickell, Stephen J, 1981. "Biases in Dynamic Models with Fixed Effects," Econometrica, Econometric Society, vol. 49(6), pages 1417-1426, November.
    42. Colm Harmon & Hessel Oosterbeek & Ian Walker, 2003. "The Returns to Education: Microeconomics," Journal of Economic Surveys, Wiley Blackwell, vol. 17(2), pages 115-156, April.
    43. Becker, Gary S & Murphy, Kevin M, 1988. "A Theory of Rational Addiction," Journal of Political Economy, University of Chicago Press, vol. 96(4), pages 675-700, August.
    44. María Engracia Rochina-Barrachina, 1999. "A New Estimator for Panel Data Sample Selection Models," Annals of Economics and Statistics, GENES, issue 55-56, pages 153-181.
    45. Manuel Arellano & Stephen Bond, 1991. "Some Tests of Specification for Panel Data: Monte Carlo Evidence and an Application to Employment Equations," The Review of Economic Studies, Review of Economic Studies Ltd, vol. 58(2), pages 277-297.
    46. Windmeijer, Frank, 2005. "A finite sample correction for the variance of linear efficient two-step GMM estimators," Journal of Econometrics, Elsevier, vol. 126(1), pages 25-51, May.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Majid M. Al-Sadoon & Sergi Jiménez-Martín & Jose M. Labeaga, 2019. "Simple methods for consistent estimation of dynamic panel data sample selection models," Economics Working Papers 1631, Department of Economics and Business, Universitat Pompeu Fabra.
    2. Sergi Jiménez-Martín & José María Labeaga, 2016. "Monte Carlo evidence on the estimation of AR(1) panel data sample selection models," Working Papers 2016-01, FEDEA.
    3. Giulia Bettin & Riccardo Lucchetti & Claudia Pigini, 2016. "State dependence and unobserved heterogeneity in a double hurdle model for remittances: evidence from immigrants to Germany," Mo.Fi.R. Working Papers 127, Money and Finance Research group (Mo.Fi.R.) - Univ. Politecnica Marche - Dept. Economic and Social Sciences.
    4. Youssef, Ahmed & Abonazel, Mohamed R., 2015. "Alternative GMM Estimators for First-order Autoregressive Panel Model: An Improving Efficiency Approach," MPRA Paper 68674, University Library of Munich, Germany.
    5. Stephen O'Neill & Kevin Hanrahan, 2016. "The capitalization of coupled and decoupled CAP payments into land rental rates," Agricultural Economics, International Association of Agricultural Economists, vol. 47(3), pages 285-294, May.
    6. Emir Malikov & Diego A. Restrepo-Tobón & Subal C. Kumbhakar, 2018. "Heterogeneous credit union production technologies with endogenous switching and correlated effects," Econometric Reviews, Taylor & Francis Journals, vol. 37(10), pages 1095-1119, November.
    7. Bakhat, Mohcine & Labandeira, Xavier & Labeaga, José M. & López-Otero, Xiral, 2017. "Elasticities of transport fuels at times of economic crisis: An empirical analysis for Spain," Energy Economics, Elsevier, vol. 68(S1), pages 66-80.
    8. Sebastian Kripfganz & Claudia Schwarz, 2019. "Estimation of linear dynamic panel data models with time‐invariant regressors," Journal of Applied Econometrics, John Wiley & Sons, Ltd., vol. 34(4), pages 526-546, June.
    9. Sigmund, Michael & Ferstl, Robert, 2021. "Panel vector autoregression in R with the package panelvar," The Quarterly Review of Economics and Finance, Elsevier, vol. 80(C), pages 693-720.
    10. Seidu, Ayuba & Onel, Gulcan & Moss, Charles Britt, 2018. "Impact of International Remittance on Out-Farm Labor Migration in Developing Countries: A Dynamic Panel Data Analysis," 2018 Annual Meeting, February 2-6, 2018, Jacksonville, Florida 266531, Southern Agricultural Economics Association.
    11. Fernández-Val, Iván & Vella, Francis, 2011. "Bias corrections for two-step fixed effects panel data estimators," Journal of Econometrics, Elsevier, vol. 163(2), pages 144-162, August.
    12. Labeaga, Jose M., 1999. "A double-hurdle rational addiction model with heterogeneity: Estimating the demand for tobacco," Journal of Econometrics, Elsevier, vol. 93(1), pages 49-72, November.
    13. Martikainen, Emmi & Schmiedel, Heiko & Takalo, Tuomas, 2015. "Convergence of European retail payments," Journal of Banking & Finance, Elsevier, vol. 50(C), pages 81-91.
    14. Piccoli, Luca & Tiezzi, Silvia, 2021. "Rational addiction and time-consistency: An empirical test," Journal of Health Economics, Elsevier, vol. 80(C).
    15. Semykina, Anastasia & Wooldridge, Jeffrey M., 2010. "Estimating panel data models in the presence of endogeneity and selection," Journal of Econometrics, Elsevier, vol. 157(2), pages 375-380, August.
    16. David Roodman, 2009. "How to do xtabond2: An introduction to difference and system GMM in Stata," Stata Journal, StataCorp LP, vol. 9(1), pages 86-136, March.
    17. Scott, K. Rebecca, 2015. "Demand and price uncertainty: Rational habits in international gasoline demand," Energy, Elsevier, vol. 79(C), pages 40-49.
    18. Monica Schuster & Miet Maertens, 2013. "8 Private Food Standards and Firm-Level Trade Effects: A Dynamic Analysis of the Peruvian Asparagus Export Sector," Frontiers of Economics and Globalization, in: Nontariff Measures with Market Imperfections: Trade and Welfare Implications, pages 187-213, Emerald Group Publishing Limited.
    19. Hayakawa, Kazuhiko, 2019. "Alternative over-identifying restriction test in the GMM estimation of panel data models," Econometrics and Statistics, Elsevier, vol. 10(C), pages 71-95.
    20. Eric Akobeng, 2017. "Gross Capital Formation, Institutions and Poverty in Sub-Saharan Africa," Journal of Economic Policy Reform, Taylor & Francis Journals, vol. 20(2), pages 136-164, April.

    More about this item

    Keywords

    panel data; sample selection; generalized method of moments; fixed and random effects; differenced estimator;
    All these keywords.

    JEL classification:

    • J52 - Labor and Demographic Economics - - Labor-Management Relations, Trade Unions, and Collective Bargaining - - - Dispute Resolution: Strikes, Arbitration, and Mediation
    • C23 - Mathematical and Quantitative Methods - - Single Equation Models; Single Variables - - - Models with Panel Data; Spatio-temporal Models
    • C24 - Mathematical and Quantitative Methods - - Single Equation Models; Single Variables - - - Truncated and Censored Models; Switching Regression Models; Threshold Regression Models

    NEP fields

    This paper has been announced in the following NEP Reports:

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:iza:izadps:dp16594. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Holger Hinte (email available below). General contact details of provider: https://edirc.repec.org/data/izaaade.html .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.