Measuring Test Measurement Error: A General Approach

My bibliography Save this paper

Measuring Test Measurement Error: A General Approach

Author

Listed:

Donald Boyd
Hamilton Lankford
Susanna Loeb
James Wyckoff

Registered:

James Wyckoff

Abstract

Test-based accountability including value-added assessments and experimental and quasi-experimental research in education rely on achievement tests to measure student skills and knowledge. Yet we know little regarding important properties of these tests, an important example being the extent of test measurement error and its implications for educational policy and practice. While test vendors provide estimates of split-test reliability, these measures do not account for potentially important day-to-day differences in student performance. We show there is a credible, low-cost approach for estimating the total test measurement error that can be applied when one or more cohorts of students take three or more tests in the subject of interest (e.g., state assessments in three consecutive grades). Our method generalizes the test-retest framework allowing for either growth or decay in knowledge and skills between tests as well as variation in the degree of measurement error across tests. The approach maintains relatively unrestrictive, testable assumptions regarding the structure of student achievement growth. Estimation only requires descriptive statistics (e.g., correlations) for the tests. When student-level test-score data are available, the extent and pattern of measurement error heteroskedasticity also can be estimated. Utilizing math and ELA test data from New York City, we estimate the overall extent of test measurement error is more than twice as large as that reported by the test vendor and demonstrate how using estimates of the total measurement error and the degree of heteroskedasticity along with observed scores can yield meaningful improvements in the precision of student achievement and achievement-gain estimates.

Suggested Citation

Donald Boyd & Hamilton Lankford & Susanna Loeb & James Wyckoff, 2012. "Measuring Test Measurement Error: A General Approach," NBER Working Papers 18010, National Bureau of Economic Research, Inc.

Handle: RePEc:nbr:nberwo:18010
Note: ED

Download full text from publisher

References listed on IDEAS

Altonji, Joseph G & Segal, Lewis M, 1996. "Small-Sample Bias in GMM Estimation of Covariance Structures," Journal of Business & Economic Statistics, American Statistical Association, vol. 14(3), pages 353-366, July.
- Joseph Altonji & Lewis M. Segal, 1994. "Small sample bias in GMM estimation of covariance structures," Working Paper Series, Macroeconomic Issues 94-8, Federal Reserve Bank of Chicago.
- Joseph G. Altonji & Lewis M. Segal, 1994. "Small Sample Bias in GMM Estimation of Covariance Structures," NBER Technical Working Papers 0156, National Bureau of Economic Research, Inc.
Daniel F. McCaffrey & Tim R. Sass & J. R. Lockwood & Kata Mihaly, 2009. "The Intertemporal Variability of Teacher Effect Estimates," Education Finance and Policy, MIT Press, vol. 4(4), pages 572-606, October.
Daniel Aaronson & Lisa Barrow & William Sander, 2007. "Teachers and Student Achievement in the Chicago Public High Schools," Journal of Labor Economics, University of Chicago Press, vol. 25(1), pages 95-135.
- Daniel Aaronson & Lisa Barrow & William Sander, 2002. "Teachers and student achievement in the Chicago public high schools," Working Paper Series WP-02-28, Federal Reserve Bank of Chicago.
Petra E. Todd & Kenneth I. Wolpin, 2003. "On The Specification and Estimation of The Production Function for Cognitive Achievement," Economic Journal, Royal Economic Society, vol. 113(485), pages 3-33, February.
Cameron,A. Colin & Trivedi,Pravin K., 2005. "Microeconometrics," Cambridge Books, Cambridge University Press, number 9780521848053, September.
Cory Koedel & Julian Betts, 2007. "Re-Examining the Role of Teacher Quality In the Educational Production Function," Working Papers 0708, Department of Economics, University of Missouri.
Abowd, John M & Card, David, 1989. "On the Covariance Structure of Earnings and Hours Changes," Econometrica, Econometric Society, vol. 57(2), pages 411-445, March.
- John M. Abowd & David Card, 1986. "On the Covariance Structure of Earnings and Hours Changes," NBER Working Papers 1832, National Bureau of Economic Research, Inc.
Daniel G. Sullivan, 2001. "A note on the estimation of linear regression models with Heteroskedastic measurement errors," Working Paper Series WP-01-23, Federal Reserve Bank of Chicago.
Dan Goldhaber & Emily Anthony, 2007. "Can Teacher Quality Be Effectively Assessed? National Board Certification as a Signal of Effective Teaching," The Review of Economics and Statistics, MIT Press, vol. 89(1), pages 134-150, February.

Full references (including those not matched with items on IDEAS)

Citations

Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.

Cited by:

Cory Koedel & Jiaxi Li, 2016. "The Efficiency Implications Of Using Proportional Evaluations To Shape The Teaching Workforce," Contemporary Economic Policy, Western Economic Association International, vol. 34(1), pages 47-62, January.
- Cory Koedel & Jiaxi Li, 2014. "The Efficiency Implications of Using Proportional Evaluations to Shape the Teaching Workforce," Working Papers 1402, Department of Economics, University of Missouri, revised 16 Mar 2015.
Timothy N. Bond & Kevin Lang, 2018. "The Black–White Education Scaled Test-Score Gap in Grades K-7," Journal of Human Resources, University of Wisconsin Press, vol. 53(4), pages 891-917.
- Timothy N. Bond & Kevin Lang, 2013. "The Black-White Education-Scaled Test-Score Gap in Grades K-7," NBER Working Papers 19243, National Bureau of Economic Research, Inc.
Jason A. Grissom & Demetra Kalogrides & Susanna Loeb, 2012. "Using Student Test Scores to Measure Principal Performance," NBER Working Papers 18568, National Bureau of Economic Research, Inc.
repec:umc:wpaper:1308 is not listed on IDEAS
J. R. Lockwood & Daniel F. McCaffrey, 2014. "Correcting for Test Score Measurement Error in ANCOVA Models for Estimating Treatment Effects," Journal of Educational and Behavioral Statistics, , vol. 39(1), pages 22-52, February.
Eric Parsons, 2014. "Does Attending a Low-Achieving School Affect High-Performing Student Outcomes?," Working Papers 1407, Department of Economics, University of Missouri, revised 18 Feb 2015.

Most related items

These are the items that most often cite the same works as this one and are cited by the same works as this one.

Buddin, Richard, 2010. "How effective are Los Angeles elementary teachers and schools?," MPRA Paper 27366, University Library of Munich, Germany.
Stacy, Brian & Guarino, Cassandra & Wooldridge, Jeffrey, 2018. "Does the precision and stability of value-added estimates of teacher performance depend on the types of students they serve?," Economics of Education Review, Elsevier, vol. 64(C), pages 50-74.
- Stacy, Brian & Guarino, Cassandra M. & Reckase, Mark D. & Wooldridge, Jeffrey M., 2013. "Does the Precision and Stability of Value-Added Estimates of Teacher Performance Depend on the Types of Students They Serve?," IZA Discussion Papers 7676, Institute of Labor Economics (IZA).
Jesse Rothstein, 2007. "Do Value-Added Models Add Value? Tracking, Fixed Effects, and Causal Inference," Working Papers 1036, Princeton University, Department of Economics, Center for Economic Policy Studies..
Jesse Rothstein, 2010. "Teacher Quality in Educational Production: Tracking, Decay, and Student Achievement," The Quarterly Journal of Economics, President and Fellows of Harvard College, vol. 125(1), pages 175-214.
- Jesse Rothstein, 2008. "Teacher Quality in Educational Production: Tracking, Decay, and Student Achievement," Working Papers 1058, Princeton University, School of Public and International Affairs, Education Research Section..
- Jesse Rothstein, 2008. "Teacher Quality in Educational Production: Tracking, Decay, and Student Achievement," NBER Working Papers 14442, National Bureau of Economic Research, Inc.
Goldhaber, Dan & Liddle, Stephanie & Theobald, Roddy, 2013. "The gateway to the profession: Assessing teacher preparation programs based on student achievement," Economics of Education Review, Elsevier, vol. 34(C), pages 29-44.
Dan Goldhaber & Michael Hansen, 2013. "Is it Just a Bad Class? Assessing the Long-term Stability of Estimated Teacher Performance," Economica, London School of Economics and Political Science, vol. 80(319), pages 589-612, July.
Goldhaber, Dan & Cowan, James & Walch, Joe, 2013. "Is a good elementary teacher always good? Assessing teacher performance estimates across subjects," Economics of Education Review, Elsevier, vol. 36(C), pages 216-228.
Jesse Rothstein, 2007. "Do Value-Added Models Add Value? Tracking, Fixed Effects, and Causal Inference," Working Papers 1036, Princeton University, Department of Economics, Center for Economic Policy Studies..
repec:pri:cepsud:159rothstein is not listed on IDEAS
Kevin C. Bastian & Gary T. Henry & Charles L. Thompson, 2013. "Incorporating Access to More Effective Teachers into Assessments of Educational Resource Equity," Education Finance and Policy, MIT Press, vol. 8(4), pages 560-580, October.
Goel, Deepti & Barooah, Bidisha, 2018. "Drivers of Student Performance: Evidence from Higher Secondary Public Schools in Delhi," GLO Discussion Paper Series 231, Global Labor Organization (GLO).
- Deepti Goel & Bidisha Barooah, 2018. "Drivers of Student Performance: Evidence from Higher Secondary Public Schools in Delhi," Working papers 289, Centre for Development Economics, Delhi School of Economics.
- Deepti Goel & Bidisha Barooah, 2018. "Drivers of Student Performance: Evidence from Higher Secondary Public Schools in Delhi," Working Papers id:12881, eSocialSciences.
- Goel, Deepti & Barooah, Bidisha, 2018. "Drivers of Student Performance: Evidence from Higher Secondary Public Schools in Delhi," IZA Discussion Papers 11670, Institute of Labor Economics (IZA).
Condie, Scott & Lefgren, Lars & Sims, David, 2014. "Teacher heterogeneity, value-added and education policy," Economics of Education Review, Elsevier, vol. 40(C), pages 76-92.
Allison Atteberry & Susanna Loeb & James Wyckoff, 2013. "Do First Impressions Matter? Improvement in Early Career Teacher Effectiveness," NBER Working Papers 19096, National Bureau of Economic Research, Inc.
Manuel Arellano & Stéphane Bonhomme, 2012. "Identifying Distributional Characteristics in Random Coefficients Panel Data Models," The Review of Economic Studies, Review of Economic Studies Ltd, vol. 79(3), pages 987-1020.
- Manuel Arellano & Stéphane Bonhomme, 2009. "Identifying distributional characteristics in random coefficients panel data models," CeMMAP working papers CWP22/09, Centre for Microdata Methods and Practice, Institute for Fiscal Studies.
- Manuel Arellano & Stéphane Bonhomme, 2009. "Identifying Distributional Characteristics in Random Coefficients Panel Data Models," Working Papers wp2009_0904, CEMFI.
Koedel, Cory & Mihaly, Kata & Rockoff, Jonah E., 2015. "Value-added modeling: A review," Economics of Education Review, Elsevier, vol. 47(C), pages 180-195.
- Cory Koedel & Kata Mihaly & Jonah E. Rockoff, 2015. "Value-Added Modeling: A Review," Working Papers 1501, Department of Economics, University of Missouri.
Buddin, Richard & Zamarro, Gema, 2009. "Teacher qualifications and student achievement in urban elementary schools," Journal of Urban Economics, Elsevier, vol. 66(2), pages 103-115, September.
Figlio, D. & Karbownik, K. & Salvanes, K.G., 2016. "Education Research and Administrative Data," Handbook of the Economics of Education,, Elsevier.
- David N. Figlio & Krzysztof Karbownik & Kjell G. Salvanes, 2015. "Education Research and Administrative Data," NBER Working Papers 21592, National Bureau of Economic Research, Inc.
- Figlio, David N. & Karbownik, Krzysztof & Salvanes, Kjell G., 2015. "Education Research and Administrative Data," IZA Discussion Papers 9474, Institute of Labor Economics (IZA).
- Figlio, David & Karbownik, Krzysztof & Salvanes, Kjell G., 2015. "Education Research and Administrative Data," Discussion Paper Series in Economics 24/2015, Norwegian School of Economics, Department of Economics.
Lindsay Fox, 2016. "Playing to Teachers’ Strengths: Using Multiple Measures of Teacher Effectiveness to Improve Teacher Assignments," Education Finance and Policy, MIT Press, vol. 11(1), pages 70-96, Winter.
Richard Buddin & Gema Zamarro, 2009. "Teacher Effectiveness in Urban High Schools," Working Papers 693, RAND Corporation.
Cory Koedel & Julian R. Betts, 2011. "Does Student Sorting Invalidate Value-Added Models of Teacher Effectiveness? An Extended Analysis of the Rothstein Critique," Education Finance and Policy, MIT Press, vol. 6(1), pages 18-42, January.
- Cory Koedel & Julian Betts, 2009. "Does Student Sorting Invalidate Value-Added Models of Teacher Effectiveness? An Extended Analysis of the Rothstein Critique," Working Papers 0902, Department of Economics, University of Missouri.
Peter Z. Schochet & Hanley S. Chiang, 2013. "What Are Error Rates for Classifying Teacher and School Performance Using Value-Added Models?," Journal of Educational and Behavioral Statistics, , vol. 38(2), pages 142-171, April.

More about this item

JEL classification:

I21 - Health, Education, and Welfare - - Education - - - Analysis of Education

NEP fields

This paper has been announced in the following NEP Reports:

NEP-ECM-2012-05-02 (Econometrics)
NEP-EDU-2012-05-02 (Education)
NEP-URE-2012-05-02 (Urban and Real Estate Economics)

Statistics

Access and download statistics

Corrections

All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:nbr:nberwo:18010. See general information about how to correct material in RePEc.

If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: the person in charge (email available below). General contact details of provider: https://edirc.repec.org/data/nberrus.html .

Please note that corrections may take a couple of weeks to filter through the various RePEc services.

IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.

Browse Econ Literature

More features

Measuring Test Measurement Error: A General Approach

Author

Abstract

Suggested Citation

Download full text from publisher

References listed on IDEAS

Citations

Most related items

More about this item

JEL classification:

NEP fields

Statistics

Corrections

More services and features

MyIDEAS

Author registration

Rankings

RePEc Genealogy

RePEc Biblio

MPRA

New papers by email

EconAcademics

Plagiarism

About RePEc

RePEc home

Blog

Help/FAQ

RePEc team

Participating archives

Privacy statement

Help us

Corrections

Volunteers

Get papers listed

Open a RePEc archive

Get RePEc data