IDEAS home Printed from https://ideas.repec.org/a/bla/obuest/v83y2021i3p742-769.html
   My bibliography  Save this article

Assessing Sampling Error in Pseudo‐Panel Models

Author

Listed:
  • Rumman Khan

Abstract

While pseudo‐panels are useful when only repeated cross‐section data are available, estimates are likely to be attenuated and suffer from sampling error if cell sizes (number of individuals grouped together in a cohort) are too few. However, there is no consensus on how large cell size needs to be, with recommendations ranging from 100 to several thousands. This is due to sampling error being affected by both cell size and three important types of variation in the cohort data (across and within cohorts and over time). We combine these into a single metric, called CAWAR, and demonstrate its relationship to sampling error using Monte Carlo simulations and an empirical application. We produce recommended values for CAWAR beyond which sampling error bias is minimal and from these one can easily calculate the required cell size.

Suggested Citation

  • Rumman Khan, 2021. "Assessing Sampling Error in Pseudo‐Panel Models," Oxford Bulletin of Economics and Statistics, Department of Economics, University of Oxford, vol. 83(3), pages 742-769, June.
  • Handle: RePEc:bla:obuest:v:83:y:2021:i:3:p:742-769
    DOI: 10.1111/obes.12416
    as

    Download full text from publisher

    File URL: https://doi.org/10.1111/obes.12416
    Download Restriction: no

    File URL: https://libkey.io/10.1111/obes.12416?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Imai, Katsushi S. & Annim, Samuel Kobina & Kulkarni, Veena S. & Gaiha, Raghav, 2014. "Women’s Empowerment and Prevalence of Stunted and Underweight Children in Rural India," World Development, Elsevier, vol. 62(C), pages 88-105.
    2. Verbeek, Marno & Vella, Francis, 2005. "Estimating dynamic models from repeated cross-sections," Journal of Econometrics, Elsevier, vol. 127(1), pages 83-102, July.
    3. Verbeek, Marno & Nijman, Theo, 1992. "Can Cohort Data Be Treated as Genuine Panel Data?," Empirical Economics, Springer, vol. 17(1), pages 9-23.
    4. Jose Cuesta & Hugo Ñopo & Georgina Pizzolitto, 2011. "Using Pseudo‐Panels To Measure Income Mobility In Latin America," Review of Income and Wealth, International Association for Research in Income and Wealth, vol. 57(2), pages 224-246, June.
    5. Moffitt, Robert, 1993. "Identification and estimation of dynamic models with a time series of repeated cross-sections," Journal of Econometrics, Elsevier, vol. 59(1-2), pages 99-123, September.
    6. Mª Dolores Collado, 1998. "Estimating binary choice models from cohort data," Investigaciones Economicas, Fundación SEPI, vol. 22(2), pages 259-276, May.
    7. Badi H. Baltagi & Georges Bresson & Jean‐Michel Etienne, 2015. "Hedonic Housing Prices in Paris: An Unbalanced Spatial Lag Pseudo‐Panel Model with Nested Random Effects," Journal of Applied Econometrics, John Wiley & Sons, Ltd., vol. 30(3), pages 509-528, April.
    8. Alessie, Rob & Devereux, Michael P. & Weber, Guglielmo, 1997. "Intertemporal consumption, durables and liquidity constraints: A cohort analysis," European Economic Review, Elsevier, vol. 41(1), pages 37-59, January.
    9. Francisca Antman & David J. McKenzie, 2007. "Earnings Mobility and Measurement Error: A Pseudo-Panel Approach," Economic Development and Cultural Change, University of Chicago Press, vol. 56(1), pages 125-161, October.
    10. Paul J. Devereux, 2007. "Small-sample bias in synthetic cohort models of labor supply," Journal of Applied Econometrics, John Wiley & Sons, Ltd., vol. 22(4), pages 839-848.
    11. Browning, Martin & Deaton, Angus & Irish, Margaret, 1985. "A Profitable Approach to Labor Supply and Commodity Demands over the Life-Cycle," Econometrica, Econometric Society, vol. 53(3), pages 503-543, May.
    12. Fulford, Scott, 2014. "Returns to Education in India," World Development, Elsevier, vol. 59(C), pages 434-450.
    13. Deaton, Angus, 1985. "Panel data from time series of cross-sections," Journal of Econometrics, Elsevier, vol. 30(1-2), pages 109-126.
    14. Devereux, Paul J., 2007. "Improved Errors-in-Variables Estimators for Grouped Data," Journal of Business & Economic Statistics, American Statistical Association, vol. 25, pages 278-287, July.
    15. Artūras Juodis, 2018. "Pseudo Panel Data Models With Cohort Interactive Effects," Journal of Business & Economic Statistics, Taylor & Francis Journals, vol. 36(1), pages 47-61, January.
    16. Propper, Carol & Rees, Hedley & Green, Katherine, 2001. "The Demand for Private Medical Insurance in the UK: A Cohort Analysis," Economic Journal, Royal Economic Society, vol. 111(471), pages 180-200, May.
    17. Banks, James & Blundell, Richard & Preston, Ian, 1994. "Life-cycle expenditure allocations and the consumption costs of children," European Economic Review, Elsevier, vol. 38(7), pages 1391-1410, August.
    18. McKenzie, D.J.David J., 2004. "Asymptotic theory for heterogeneous dynamic pseudo-panels," Journal of Econometrics, Elsevier, vol. 120(2), pages 235-262, June.
    19. Verbeek, Marno & Nijman, Theo, 1993. "Minimum MSE estimation of a regression model with fixed effects from a series of cross-sections," Journal of Econometrics, Elsevier, vol. 59(1-2), pages 125-136, September.
    20. Angrist, Joshua D., 1991. "Grouped-data estimation and testing in simple labor-supply models," Journal of Econometrics, Elsevier, vol. 47(2-3), pages 243-266, February.
    21. Dolores Collado, M., 1997. "Estimating dynamic models from time series of independent cross-sections," Journal of Econometrics, Elsevier, vol. 82(1), pages 37-62.
    22. Rumman Khan, 2018. "Assessing cohort aggregation to minimise bias in pseudo-panels," Discussion Papers 2018-01, University of Nottingham, CREDIT.
    23. Arestoff, Florence & Djemai, Elodie, 2016. "Women’s Empowerment Across the Life Cycle and Generations: Evidence from Sub-Saharan Africa," World Development, Elsevier, vol. 87(C), pages 70-87.
    24. Verbeek, M.J.C.M. & Nijman, T.E., 1992. "Can cohort data be treated as genuine panel data?," Other publications TiSEM d4eada8f-b91c-4fe7-a58c-7, Tilburg University, School of Economics and Management.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Stemmler, Henry & Meemken, Eva-Marie, 2023. "Greenhouse farming and employment: Evidence from Ecuador," Food Policy, Elsevier, vol. 117(C).
    2. Apostolos Fasianos & Reamonn Lydon, 2022. "Do households with debt cut back their consumption more? New evidence from the United Kingdom," Bulletin of Economic Research, Wiley Blackwell, vol. 74(3), pages 737-760, July.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Rumman Khan, 2018. "Assessing cohort aggregation to minimise bias in pseudo-panels," Discussion Papers 2018-01, University of Nottingham, CREDIT.
    2. Paul J. Devereux, 2007. "Small-sample bias in synthetic cohort models of labor supply," Journal of Applied Econometrics, John Wiley & Sons, Ltd., vol. 22(4), pages 839-848.
    3. Hai‐Anh H. Dang & Peter F. Lanjouw, 2023. "Measuring Poverty Dynamics with Synthetic Panels Based on Repeated Cross Sections," Oxford Bulletin of Economics and Statistics, Department of Economics, University of Oxford, vol. 85(3), pages 599-622, June.
    4. Verbeek, Marno & Vella, Francis, 2005. "Estimating dynamic models from repeated cross-sections," Journal of Econometrics, Elsevier, vol. 127(1), pages 83-102, July.
    5. Xavier d'Haultfoeuille & Stefan Hoderlein & Yuya Sasaki, 2013. "Nonlinear difference-in-differences in repeated cross sections with continuous treatments," CeMMAP working papers CWP40/13, Centre for Microdata Methods and Practice, Institute for Fiscal Studies.
    6. Ortiz, Rodrigo & Fernandez, Viviana, 2022. "Business perception of obstacles to innovate: Evidence from Chile with pseudo-panel data analysis," Research in International Business and Finance, Elsevier, vol. 59(C).
    7. Inoue, Atsushi, 2008. "Efficient estimation and inference in linear pseudo-panel data models," Journal of Econometrics, Elsevier, vol. 142(1), pages 449-466, January.
    8. Rosati, Nicoletta, 2013. "Efficiency of repeated-cross-section estimators in fixed-effects models," Statistics & Probability Letters, Elsevier, vol. 83(7), pages 1770-1775.
    9. Dang,Hai-Anh H. & Lanjouw,Peter F., 2013. "Measuring poverty dynamics with synthetic panels based on cross-sections," Policy Research Working Paper Series 6504, The World Bank.
    10. Daniel Bukstein & Nestor Gandelman, 2014. "Intra-Generational Social Mobility and Entrepreneurship in Uruguay," Latin American Journal of Economics-formerly Cuadernos de Economía, Instituto de Economía. Pontificia Universidad Católica de Chile., vol. 51(2), pages 227-245, November.
    11. Jose Cuesta & Hugo Ñopo & Georgina Pizzolitto, 2011. "Using Pseudo‐Panels To Measure Income Mobility In Latin America," Review of Income and Wealth, International Association for Research in Income and Wealth, vol. 57(2), pages 224-246, June.
    12. Tamvada, Jagannadha Pawan, 2010. "The Dynamics of Self-employment in a Developing Country: Evidence from India," MPRA Paper 20042, University Library of Munich, Germany.
    13. Böhringer, Christoph & García-Muros, Xaquín & González-Eguino, Mikel, 2022. "Who bears the burden of greening electricity?," Energy Economics, Elsevier, vol. 105(C).
    14. World Bank, 2016. "Tunisia Poverty Assessment 2015," World Bank Publications - Reports 24410, The World Bank Group.
    15. Artūras Juodis, 2018. "Pseudo Panel Data Models With Cohort Interactive Effects," Journal of Business & Economic Statistics, Taylor & Francis Journals, vol. 36(1), pages 47-61, January.
    16. Kanang Amos Akims & Perez Ayieko Onono & Dianah Mukwate Ngui, . "Trade Liberalization and Productivity in the Nigerian Manufacturing Sector," Journal of Economic and Sustainable Growth 3, Office Of The Chief Economist, Development Bank of Nigeria.
    17. McKenzie, D.J.David J., 2004. "Asymptotic theory for heterogeneous dynamic pseudo-panels," Journal of Econometrics, Elsevier, vol. 120(2), pages 235-262, June.
    18. David Aristei & Luca Pieroni, 2010. "Habits, Complementarities and Heterogeneity in Alcohol and Tobacco Demand: A Multivariate Dynamic Model," Oxford Bulletin of Economics and Statistics, Department of Economics, University of Oxford, vol. 72(4), pages 428-457, August.
    19. Hugo Ñopo & Giorgina Pizzolitto & José Cuesta, 2007. "Usando pseudopaneles para medir la movilidad del ingreso en América," Research Department Publications 4558, Inter-American Development Bank, Research Department.
    20. Badi Baltagi & Seuck Song, 2006. "Unbalanced panel data: A survey," Statistical Papers, Springer, vol. 47(4), pages 493-523, October.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:bla:obuest:v:83:y:2021:i:3:p:742-769. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Wiley Content Delivery (email available below). General contact details of provider: https://edirc.repec.org/data/sfeixuk.html .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.