IDEAS home Printed from https://ideas.repec.org/p/nbr/nberwo/10118.html
   My bibliography  Save this paper

The Central Role of Noise in Evaluating Interventions that Use Test Scores to Rank Schools

Author

Listed:
  • Kenneth Y. Chay
  • Patrick J. McEwan
  • Miguel Urquiola

Abstract

Several countries have implemented programs that use test scores to rank schools, and to reward or penalize them based on their students' average performance. Recently, Kane and Staiger (2002) have warned that imprecision in the measurement of school-level test scores could impede these efforts. There is little evidence, however, on how seriously noise hinders the evaluation of the impact of these interventions. We examine these issues in the context of Chile's P-900 program a country-wide intervention in which resources were allocated based on cutoffs in schools' mean test scores. We show that transitory noise in average scores and mean reversion lead conventional estimation approaches to greatly overstate the impacts of such programs. We then show how a regression discontinuity design that utilizes the discrete nature of the selection rule can be used to control for reversion biases. While the RD analysis provides convincing evidence that the P-900 program had significant effects on test score gains, these effects are much smaller than is widely believed.

Suggested Citation

  • Kenneth Y. Chay & Patrick J. McEwan & Miguel Urquiola, 2003. "The Central Role of Noise in Evaluating Interventions that Use Test Scores to Rank Schools," NBER Working Papers 10118, National Bureau of Economic Research, Inc.
  • Handle: RePEc:nbr:nberwo:10118
    Note: ED LS
    as

    Download full text from publisher

    File URL: http://www.nber.org/papers/w10118.pdf
    Download Restriction: no
    ---><---

    Other versions of this item:

    References listed on IDEAS

    as
    1. Thomas J. Kane & Douglas O. Staiger, 2002. "The Promise and Pitfalls of Using Imprecise School Accountability Measures," Journal of Economic Perspectives, American Economic Association, vol. 16(4), pages 91-114, Fall.
    2. Ashenfelter, Orley C, 1978. "Estimating the Effect of Training Programs on Earnings," The Review of Economics and Statistics, MIT Press, vol. 60(1), pages 47-57, February.
    3. Eric A. Hanushek & Margaret E. Raymond, 2002. "Improving educational quality: how best to evaluate our schools," Conference Series ; [Proceedings], Federal Reserve Bank of Boston, vol. 47(Jun), pages 193-247.
    4. Andrea Tokman, 2002. "Evaluation of the P900 Program: A Targeted Education Program for Underperforming Schools," Working Papers Central Bank of Chile 170, Central Bank of Chile.
    5. Brian A. Jacob & Lars Lefgren, 2004. "Remedial Education and Student Achievement: A Regression-Discontinuity Analysis," The Review of Economics and Statistics, MIT Press, vol. 86(1), pages 226-244, February.
    6. Kenneth Y. Chay & Patrick J. McEwan & Miguel Urquiola, 2005. "The Central Role of Noise in Evaluating Interventions That Use Test Scores to Rank Schools," American Economic Review, American Economic Association, vol. 95(4), pages 1237-1258, September.
    7. Brian A. Jacob & Lars Lefgren, 2004. "The Impact of Teacher Training on Student Achievement: Quasi-Experimental Evidence from School Reform Efforts in Chicago," Journal of Human Resources, University of Wisconsin Press, vol. 39(1).
    8. Urquiola, Miguel, 2001. "Identifying class size effects in developing countries : evidence from rural schools in Bolivia," Policy Research Working Paper Series 2711, The World Bank.
    9. Yolanda Kodrzycki, 2002. "Education in the 21st century: meeting the challenges of a changing world," New England Economic Review, Federal Reserve Bank of Boston, issue Q 4, pages 3-18.
    10. John H. Tyler & Richard J. Murnane & John B. Willett, 2000. "Estimating the Labor Market Signaling Value of the GED," The Quarterly Journal of Economics, President and Fellows of Harvard College, vol. 115(2), pages 431-468.
    11. Paul Glewwe & Nauman Ilias & Michael Kremer, 2010. "Teacher Incentives," American Economic Journal: Applied Economics, American Economic Association, vol. 2(3), pages 205-227, July.
    12. Ashenfelter, Orley & Card, David, 1985. "Using the Longitudinal Structure of Earnings to Estimate the Effect of Training Programs," The Review of Economics and Statistics, MIT Press, vol. 67(4), pages 648-660, November.
    13. Jonathan Guryan, 2001. "Does Money Matter? Regression-Discontinuity Estimates from Education Finance Reform in Massachusetts," NBER Working Papers 8269, National Bureau of Economic Research, Inc.
    14. Joshua D. Angrist & Victor Lavy, 1999. "Using Maimonides' Rule to Estimate the Effect of Class Size on Scholastic Achievement," The Quarterly Journal of Economics, President and Fellows of Harvard College, vol. 114(2), pages 533-575.
    15. Angrist, Joshua D. & Krueger, Alan B., 1999. "Empirical strategies in labor economics," Handbook of Labor Economics, in: O. Ashenfelter & D. Card (ed.), Handbook of Labor Economics, edition 1, volume 3, chapter 23, pages 1277-1366, Elsevier.
    16. Thomas J. Kane & Douglas O. Staiger, 2001. "Improving School Accountability Measures," NBER Working Papers 8156, National Bureau of Economic Research, Inc.
    17. Carlson, Beverley A., 2000. "Achieving educational quality: what schools teach us: learning from Chile's P900 primary schools," Desarrollo Productivo 4668, Naciones Unidas Comisión Económica para América Latina y el Caribe (CEPAL).
    18. Chang-Tai Hsieh & Miguel Urquiola, 2002. "When Schools Compete, How Do They Compete? An Assessment of Chile's Nationwide School Voucher Program," Working Papers 123, Princeton University, Department of Economics, Center for Economic Policy Studies..
    19. Victor Lavy, 2002. "Evaluating the Effect of Teachers' Group Performance Incentives on Pupil Achievement," Journal of Political Economy, University of Chicago Press, vol. 110(6), pages 1286-1317, December.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Emiliana Vegas & Ilana Umansky, 2005. "Improving Teaching and Learning through Effective Incentives : What Can We Learn from Education Reforms in Latin America?," World Bank Publications - Reports 8694, The World Bank Group.
    2. Frandsen, Brigham R. & Frölich, Markus & Melly, Blaise, 2012. "Quantile treatment effects in the regression discontinuity design," Journal of Econometrics, Elsevier, vol. 168(2), pages 382-395.
    3. Vegas, E & Ganimian, A. J., 2013. "Theory and Evidence on Teacher Policies in Developed and Developing Countries," Working Paper 104291, Harvard University OpenScholar.
    4. David S. Lee & Thomas Lemieux, 2010. "Regression Discontinuity Designs in Economics," Journal of Economic Literature, American Economic Association, vol. 48(2), pages 281-355, June.
    5. Mauricio Villamizar‐Villegas & Freddy A. Pinzon‐Puerto & Maria Alejandra Ruiz‐Sanchez, 2022. "A comprehensive history of regression discontinuity designs: An empirical survey of the last 60 years," Journal of Economic Surveys, Wiley Blackwell, vol. 36(4), pages 1130-1178, September.
    6. Chakrabarti Rajashri, 2013. "Impact of Voucher Design on Public School Performance: Evidence from Florida and Milwaukee Voucher Programs," The B.E. Journal of Economic Analysis & Policy, De Gruyter, vol. 13(1), pages 349-394, July.
    7. Mizala, Alejandra & Romaguera, Pilar & Urquiola, Miguel, 2007. "Socioeconomic status or noise? Tradeoffs in the generation of school quality information," Journal of Development Economics, Elsevier, vol. 84(1), pages 61-75, September.
    8. Dinand Webbink, 2005. "Causal Effects in Education," Journal of Economic Surveys, Wiley Blackwell, vol. 19(4), pages 535-560, September.
    9. Greenstone, Michael & Gayer, Ted, 2009. "Quasi-experimental and experimental approaches to environmental economics," Journal of Environmental Economics and Management, Elsevier, vol. 57(1), pages 21-44, January.
    10. Joshua D. Angrist & Jörn-Steffen Pischke, 2010. "The Credibility Revolution in Empirical Economics: How Better Research Design Is Taking the Con out of Econometrics," Journal of Economic Perspectives, American Economic Association, vol. 24(2), pages 3-30, Spring.
    11. Guido W. Imbens & Jeffrey M. Wooldridge, 2009. "Recent Developments in the Econometrics of Program Evaluation," Journal of Economic Literature, American Economic Association, vol. 47(1), pages 5-86, March.
    12. Alejandra Mizala & Pilar Romaguera, 2003. "Rendimiento Escolar y Premios por Desempeño. La Experiencia Latinoamericana (Scholastic performance and performance awards)," Documentos de Trabajo 157, Centro de Economía Aplicada, Universidad de Chile.
    13. Facundo Albornoz & María Victoria Anauati & Melina Furman & Mariana Luzuriaga & María Eugenia Podestá & Inés Taylor, 2020. "Training to Teach Science: Experimental Evidence from Argentina," The World Bank Economic Review, World Bank, vol. 34(2), pages 393-417.
    14. Patrick J. McEwan, 2012. "Cost-effectiveness analysis of education and health interventions in developing countries," Journal of Development Effectiveness, Taylor & Francis Journals, vol. 4(2), pages 189-213, June.
    15. Roel Elk & Marc Steeg & Dinand Webbink, 2013. "Can Financial Incentives for Regional Education Authorities Reduce School Dropout?," De Economist, Springer, vol. 161(4), pages 367-398, December.
    16. Wo[ss]mann, Ludger & West, Martin, 2006. "Class-size effects in school systems around the world: Evidence from between-grade variation in TIMSS," European Economic Review, Elsevier, vol. 50(3), pages 695-736, April.
    17. Tahir Andrabi & Jishnu Das & Asim Ijaz Khwaja & Tristan Zajonc, 2011. "Do Value-Added Estimates Add Value? Accounting for Learning Dynamics," American Economic Journal: Applied Economics, American Economic Association, vol. 3(3), pages 29-54, July.
    18. Emanuela di Gropello, 2006. "Meeting the Challenges of Secondary Education in Latin America and East Asia : Improving Efficiency and Resource Mobilization," World Bank Publications - Books, The World Bank Group, number 7173, December.
    19. David S. Lee & Thomas Lemieux, 2009. "Regression Discontinuity Designs In Economics," Working Papers 1118, Princeton University, Department of Economics, Industrial Relations Section..
    20. Victor Lavy & Analia Schlosser, 2005. "Targeted Remedial Education for Underperforming Teenagers: Costs and Benefits," Journal of Labor Economics, University of Chicago Press, vol. 23(4), pages 839-874, October.

    More about this item

    JEL classification:

    • I2 - Health, Education, and Welfare - - Education

    NEP fields

    This paper has been announced in the following NEP Reports:

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:nbr:nberwo:10118. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: the person in charge (email available below). General contact details of provider: https://edirc.repec.org/data/nberrus.html .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.