IDEAS home Printed from https://ideas.repec.org/a/sae/jedbes/v38y2013i6p629-663.html
   My bibliography  Save this article

Measuring Test Measurement Error

Author

Listed:
  • Donald Boyd
  • Hamilton Lankford
  • Susanna Loeb
  • James Wyckoff

Abstract

Test-based accountability as well as value-added asessments and much experimental and quasi-experimental research in education rely on achievement tests to measure student skills and knowledge. Yet, we know little regarding fundamental properties of these tests, an important example being the extent of measurement error and its implications for educational policy and practice. While test vendors provide estimates of split-test reliability, these measures do not account for potentially important day-to-day differences in student performance. In this article, we demonstrate a credible, low-cost approach for estimating the overall extent of measurement error that can be applied when students take three or more tests in the subject of interest (e.g., state assessments in consecutive grades). Our method generalizes the test–retest framework by allowing for (a) growth or decay in knowledge and skills between tests, (b) tests being neither parallel nor vertically scaled, and (c) the degree of measurement error varying across tests. The approach maintains relatively unrestrictive, testable assumptions regarding the structure of student achievement growth. Estimation only requires descriptive statistics (e.g., test-score correlations). With student-level data, the extent and pattern of measurement-error heteroscedasticity also can be estimated. In turn, one can compute Bayesian posterior means of achievement and achievement gains given observed scores—estimators having statistical properties superior to those for the observed score (score gain). We employ math and English language arts test-score data from New York City to demonstrate these methods and estimate the overall extent of test measurement error is at least twice as large as that reported by the test vendor.

Suggested Citation

  • Donald Boyd & Hamilton Lankford & Susanna Loeb & James Wyckoff, 2013. "Measuring Test Measurement Error," Journal of Educational and Behavioral Statistics, , vol. 38(6), pages 629-663, December.
  • Handle: RePEc:sae:jedbes:v:38:y:2013:i:6:p:629-663
    DOI: 10.3102/1076998613508584
    as

    Download full text from publisher

    File URL: https://journals.sagepub.com/doi/10.3102/1076998613508584
    Download Restriction: no

    File URL: https://libkey.io/10.3102/1076998613508584?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Altonji, Joseph G & Segal, Lewis M, 1996. "Small-Sample Bias in GMM Estimation of Covariance Structures," Journal of Business & Economic Statistics, American Statistical Association, vol. 14(3), pages 353-366, July.
    2. K. Jöreskog, 1971. "Statistical analysis of sets of congeneric tests," Psychometrika, Springer;The Psychometric Society, vol. 36(2), pages 109-133, June.
    3. Dale Ballou, 2009. "Test Scaling and Value-Added Measurement," Education Finance and Policy, MIT Press, vol. 4(4), pages 351-383, October.
    4. Karl Jöreskog, 1978. "Structural analysis of covariance and correlation matrices," Psychometrika, Springer;The Psychometric Society, vol. 43(4), pages 443-477, December.
    5. Wei Shen & Thomas A. Louis, 1998. "Triple‐goal estimates in two‐stage hierarchical models," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 60(2), pages 455-471.
    6. Abowd, John M & Card, David, 1989. "On the Covariance Structure of Earnings and Hours Changes," Econometrica, Econometric Society, vol. 57(2), pages 411-445, March.
    7. Petra E. Todd & Kenneth I. Wolpin, 2003. "On The Specification and Estimation of The Production Function for Cognitive Achievement," Economic Journal, Royal Economic Society, vol. 113(485), pages 3-33, February.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Nirav Mehta, 2019. "Measuring quality for use in incentive schemes: The case of “shrinkage” estimators," Quantitative Economics, Econometric Society, vol. 10(4), pages 1537-1577, November.
    2. Eric Parsons & Cory Koedel & Li Tan, 2019. "Accounting for Student Disadvantage in Value-Added Models," Journal of Educational and Behavioral Statistics, , vol. 44(2), pages 144-179, April.
    3. Koedel, Cory & Mihaly, Kata & Rockoff, Jonah E., 2015. "Value-added modeling: A review," Economics of Education Review, Elsevier, vol. 47(C), pages 180-195.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Donald Boyd & Hamilton Lankford & Susanna Loeb & James Wyckoff, 2012. "Measuring Test Measurement Error: A General Approach," NBER Working Papers 18010, National Bureau of Economic Research, Inc.
    2. Koedel Cory & Leatherman Rebecca & Parsons Eric, 2012. "Test Measurement Error and Inference from Value-Added Models," The B.E. Journal of Economic Analysis & Policy, De Gruyter, vol. 12(1), pages 1-37, November.
    3. Joachim Inkmann, 2000. "Finite Sample Properties of One-Step, Two-Step and Bootstrap Empirical Likelihood Approaches to Efficient GMM Estimation," Econometric Society World Congress 2000 Contributed Papers 0332, Econometric Society.
    4. Masakatsu Okubo, 2015. "Earnings Dynamics and Profile Heterogeneity: Estimates from Japanese Panel Data," The Japanese Economic Review, Japanese Economic Association, vol. 66(1), pages 112-146, March.
    5. Combes, Pierre-Philippe & Magnac, Thierry & Robin, Jean-Marc, 2004. "The dynamics of local employment in France," Journal of Urban Economics, Elsevier, vol. 56(2), pages 217-243, September.
    6. Dmytro Hryshko, 2012. "Labor income profiles are not heterogeneous: Evidence from income growth rates," Quantitative Economics, Econometric Society, vol. 3(2), pages 177-209, July.
    7. Jeremy Lise & Costas Meghir & Jean-Marc Robin, 2016. "Matching, Sorting and Wages," Review of Economic Dynamics, Elsevier for the Society for Economic Dynamics, vol. 19, pages 63-87, January.
    8. Otto Kässi, 2014. "Earnings dynamics of men and women in Finland: permanent inequality versus earnings instability," Empirical Economics, Springer, vol. 46(2), pages 451-477, March.
    9. Carlos Madeira, 2015. "Identification of Earning Dynamics using Rotating Samples over Short Periods: The Case of Chile," Working Papers Central Bank of Chile 754, Central Bank of Chile.
    10. Halliday Timothy, 2011. "Health Inequality over the Life-Cycle," The B.E. Journal of Economic Analysis & Policy, De Gruyter, vol. 11(3), pages 1-21, October.
    11. Nicolas Roys, 2016. "Persistence of Shocks and the Reallocation of Labor," Review of Economic Dynamics, Elsevier for the Society for Economic Dynamics, vol. 22, pages 109-130, October.
    12. Ylenia Brilli, 2022. "Mother’s Time Allocation, Childcare, and Child Cognitive Development," Journal of Human Capital, University of Chicago Press, vol. 16(2), pages 233-272.
    13. Kai Liu, 2010. "Wage Risk, On-the-job Search and Partial Insurance," 2010 Meeting Papers 1136, Society for Economic Dynamics.
    14. Fatih Guvenen, 2009. "An Empirical Investigation of Labor Income Processes," Review of Economic Dynamics, Elsevier for the Society for Economic Dynamics, vol. 12(1), pages 58-79, January.
    15. Jesse Rothstein, 2007. "Do Value-Added Models Add Value? Tracking, Fixed Effects, and Causal Inference," Working Papers 1036, Princeton University, Department of Economics, Center for Economic Policy Studies..
    16. Blundell, Richard & Preston, Ian & Pistaferri, Luigi, 2002. "Partial Insurance, Information, and Consumption Dynamics," CEPR Discussion Papers 3666, C.E.P.R. Discussion Papers.
    17. Sarstedt, Marko & Ringle, Christian M. & Smith, Donna & Reams, Russell & Hair, Joseph F., 2014. "Partial least squares structural equation modeling (PLS-SEM): A useful tool for family business researchers," Journal of Family Business Strategy, Elsevier, vol. 5(1), pages 105-115.
    18. Laszlo, Sonia, 2008. "Education, Labor Supply, and Market Development in Rural Peru," World Development, Elsevier, vol. 36(11), pages 2421-2439, November.
    19. Fouarge, Didier & Muffels, Ruud, 2000. "Persistent poverty in the Netherlands, Germany and the UK," MPRA Paper 13297, University Library of Munich, Germany.
    20. Jesse Rothstein, 2010. "Teacher Quality in Educational Production: Tracking, Decay, and Student Achievement," The Quarterly Journal of Economics, President and Fellows of Harvard College, vol. 125(1), pages 175-214.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:sae:jedbes:v:38:y:2013:i:6:p:629-663. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: SAGE Publications (email available below). General contact details of provider: .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.