Test Scaling and Value-Added Measurement

Test Scaling and Value-Added Measurement

Author

Listed:

Dale Ballou
(Department of Leadership, Policy and Organizations, Peabody College, Vanderbilt University)

Abstract

Conventional value-added assessment requires that achievement be reported on an interval scale. While many metrics do not have this property, application of item response theory (IRT) is said to produce interval scales. However, it is difficult to confirm that the requisite conditions are met. Even when they are, the properties of the data that make a test IRT scalable may not be the properties we seek to represent in an achievement scale, as shown by the lack of surface plausibility of many scales resulting from the application of IRT. An alternative, ordinal data analysis, is presented. It is shown that value-added estimates are sensitive to the choice of ordinal methods over conventional techniques. Value-added practitioners should ask themselves whether they are so confident of the metric properties of these scales that they are willing to attribute differences to the superiority of the latter. © 2009 American Education Finance Association

Suggested Citation

Dale Ballou, 2009. "Test Scaling and Value-Added Measurement," Education Finance and Policy, MIT Press, vol. 4(4), pages 351-383, October.

Handle: RePEc:tpr:edfpol:v:4:y:2009:i:4:p:351-383

Download full text from publisher

As the access to this document is restricted, you may want to

for a different version of it.

Citations

Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.

Cited by:

Barrett, Nathan & Toma, Eugenia F., 2013. "Reward or punishment? Class size and teacher quality," Economics of Education Review, Elsevier, vol. 35(C), pages 41-52.
Alexander Robitzsch, 2024. "Estimation of Standard Error, Linking Error, and Total Error for Robust and Nonrobust Linking Methods in the Two-Parameter Logistic Model," Stats, MDPI, vol. 7(3), pages 1-21, June.
Koedel Cory & Leatherman Rebecca & Parsons Eric, 2012. "Test Measurement Error and Inference from Value-Added Models," The B.E. Journal of Economic Analysis & Policy, De Gruyter, vol. 12(1), pages 1-37, November.
- Cory Koedel & Rebecca Leatherman & Eric Parsons, 2012. "Test Measurement Error and Inference from Value-Added Models," Working Papers 1201, Department of Economics, University of Missouri.
Cory Koedel & Mark Ehlert & Eric Parsons & Michael Podgursky, 2012. "Selecting Growth Measures for School and Teacher Evaluations," Working Papers 1210, Department of Economics, University of Missouri.
- Cory Koedel & Mark Ehlert & Eric Parsons & Michael Podgursky & P. Brett Xiang, 2014. "Selecting Growth Measures for School and Teacher Evaluations," Working Papers 1401, Department of Economics, University of Missouri.
Seth Gershenson, 2016. "Performance Standards and Employee Effort: Evidence From Teacher Absences," Journal of Policy Analysis and Management, John Wiley & Sons, Ltd., vol. 35(3), pages 615-638, June.
- Seth Gershenson, 2015. "Performance Standards and Employee Effort: Evidence from Teacher Absences," Upjohn Working Papers 15-217, W.E. Upjohn Institute for Employment Research.
- Gershenson, Seth, 2015. "Performance Standards and Employee Effort: Evidence from Teacher Absences," IZA Discussion Papers 9203, Institute of Labor Economics (IZA).
Seth Gershenson & Diane Whitmore Schanzenbach, 2016. "Linking Teacher Quality, Student Attendance, and Student Achievement," Education Finance and Policy, MIT Press, vol. 11(2), pages 125-149, Spring.
Alexander Robitzsch, 2021. "About the Equivalence of the Latent D-Scoring Model and the Two-Parameter Logistic Item Response Model," Mathematics, MDPI, vol. 9(13), pages 1-17, June.
Benjamin R. Shear & Sean F. Reardon, 2021. "Using Pooled Heteroskedastic Ordered Probit Models to Improve Small-Sample Estimates of Latent Test Score Distributions," Journal of Educational and Behavioral Statistics, , vol. 46(1), pages 3-33, February.
Daniel M. Bolt & Xiangyi Liao, 2022. "Item Complexity: A Neglected Psychometric Feature of Test Items?," Psychometrika, Springer;The Psychometric Society, vol. 87(4), pages 1195-1213, December.
Derek C. Briggs & Ben Domingue, 2013. "The Gains From Vertical Scaling," Journal of Educational and Behavioral Statistics, , vol. 38(6), pages 551-576, December.
Donald Boyd & Hamilton Lankford & Susanna Loeb & James Wyckoff, 2013. "Measuring Test Measurement Error," Journal of Educational and Behavioral Statistics, , vol. 38(6), pages 629-663, December.
Gadi Barlevy & Derek Neal, 2012. "Pay for Percentile," American Economic Review, American Economic Association, vol. 102(5), pages 1805-1831, August.
- Barlevy, Gadi & Neal, Derek, 2009. "Pay for Percentile," IZA Discussion Papers 4383, Institute of Labor Economics (IZA).
- Gadi Barlevy & Derek Neal, 2011. "Pay for Percentile," NBER Working Papers 17194, National Bureau of Economic Research, Inc.
- Gadi Barlevy & Derek Neal, 2009. "Pay for percentile," Working Paper Series WP-09-09, Federal Reserve Bank of Chicago.
Gershenson, Seth & Holt, Stephen B. & Papageorge, Nicholas W., 2015. "Who Believes in Me? The Effect of Student-Teacher Demographic Match on Teacher Expectations," IZA Discussion Papers 9202, Institute of Labor Economics (IZA).
- Seth Gershenson & Stephen B. Holt & Nicholas Papageorge, 2015. "Who Believes in Me? The Effect of Student-Teacher Demographic Match on Teacher Expectations," Upjohn Working Papers 15-231, W.E. Upjohn Institute for Employment Research.
Brendan Houng & Moshe Justman, 2013. "Comparing Least-Squares Value-Added Analysis and Student Growth Percentile Analysis for Evaluating Student Progress and Estimating School Effects," Melbourne Institute Working Paper Series wp2013n07, Melbourne Institute of Applied Economic and Social Research, The University of Melbourne.
David M. Quinn & Andrew D. Ho, 2021. "Ordinal Approaches to Decomposing Between-Group Test Score Disparities," Journal of Educational and Behavioral Statistics, , vol. 46(4), pages 466-500, August.
Moshe Justman & Brendan Houng, 2013. "A Comparison Of Two Methods For Estimating School Effects And Tracking Student Progress From Standardized Test Scores," Working Papers 1316, Ben-Gurion University of the Negev, Department of Economics.
Wiswall, Matthew, 2013. "The dynamics of teacher quality," Journal of Public Economics, Elsevier, vol. 100(C), pages 61-78.
J. R. Lockwood & Daniel F. McCaffrey, 2014. "Correcting for Test Score Measurement Error in ANCOVA Models for Estimating Treatment Effects," Journal of Educational and Behavioral Statistics, , vol. 39(1), pages 22-52, February.

More about this item

Keywords

; ; ;

JEL classification:

I20 - Health, Education, and Welfare - - Education - - - General
I21 - Health, Education, and Welfare - - Education - - - Analysis of Education

Statistics

Access and download statistics

Corrections

All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:tpr:edfpol:v:4:y:2009:i:4:p:351-383. See general information about how to correct material in RePEc.

If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

We have no bibliographic references for this item. You can help adding them by using this form .

If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: The MIT Press (email available below). General contact details of provider: https://direct.mit.edu/journals .

Please note that corrections may take a couple of weeks to filter through the various RePEc services.

IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.

Browse Econ Literature

More features

Test Scaling and Value-Added Measurement

Author

Abstract

Suggested Citation

Download full text from publisher

Citations

More about this item

Keywords

JEL classification:

Statistics

Corrections

More services and features

MyIDEAS

Author registration

Rankings

RePEc Genealogy

RePEc Biblio

MPRA

New papers by email

EconAcademics

Plagiarism

About RePEc

RePEc home

Blog

Help/FAQ

RePEc team

Participating archives

Privacy statement

Help us

Corrections

Volunteers

Get papers listed

Open a RePEc archive

Get RePEc data