IDEAS home Printed from https://ideas.repec.org/a/spr/psycho/v84y2019i1d10.1007_s11336-018-9649-2.html
   My bibliography  Save this article

High-Stakes Testing Case Study: A Latent Variable Approach for Assessing Measurement and Prediction Invariance

Author

Listed:
  • Steven Andrew Culpepper

    (University of Illinois at Urbana–Champaign
    University of Illinois at Urbana–Champaign)

  • Herman Aguinis

    (George Washington University)

  • Justin L. Kern

    (University of Illinois at Urbana-Champaign)

  • Roger Millsap

    (Arizona State University)

Abstract

The existence of differences in prediction systems involving test scores across demographic groups continues to be a thorny and unresolved scientific, professional, and societal concern. Our case study uses a two-stage least squares (2SLS) estimator to jointly assess measurement invariance and prediction invariance in high-stakes testing. So, we examined differences across groups based on latent as opposed to observed scores with data for 176 colleges and universities from The College Board. Results showed that evidence regarding measurement invariance was rejected for the SAT mathematics (SAT-M) subtest at the 0.01 level for 74.5% and 29.9% of cohorts for Black versus White and Hispanic versus White comparisons, respectively. Also, on average, Black students with the same standing on a common factor had observed SAT-M scores that were nearly a third of a standard deviation lower than for comparable Whites. We also found evidence that group differences in SAT-M measurement intercepts may partly explain the well-known finding of observed differences in prediction intercepts. Additionally, results provided evidence that nearly a quarter of the statistically significant observed intercept differences were not statistically significant at the 0.05 level once predictor measurement error was accounted for using the 2SLS procedure. Our joint measurement and prediction invariance approach based on latent scores opens the door to a new high-stakes testing research agenda whose goal is to not simply assess whether observed group-based differences exist and the size and direction of such differences. Rather, the goal of this research agenda is to assess the causal chain starting with underlying theoretical mechanisms (e.g., contextual factors, differences in latent predictor scores) that affect the size and direction of any observed differences.

Suggested Citation

  • Steven Andrew Culpepper & Herman Aguinis & Justin L. Kern & Roger Millsap, 2019. "High-Stakes Testing Case Study: A Latent Variable Approach for Assessing Measurement and Prediction Invariance," Psychometrika, Springer;The Psychometric Society, vol. 84(1), pages 285-309, March.
  • Handle: RePEc:spr:psycho:v:84:y:2019:i:1:d:10.1007_s11336-018-9649-2
    DOI: 10.1007/s11336-018-9649-2
    as

    Download full text from publisher

    File URL: http://link.springer.com/10.1007/s11336-018-9649-2
    File Function: Abstract
    Download Restriction: Access to the full text of the articles in this series is restricted.

    File URL: https://libkey.io/10.1007/s11336-018-9649-2?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Anonymous, 2018. "Principles for the Validation and Use of Personnel Selection Procedures," Industrial and Organizational Psychology, Cambridge University Press, vol. 11(S1), pages 1-97, December.
    2. Jerry A. Hausman & Whitney K. Newey & Tiemen Woutersen & John C. Chao & Norman R. Swanson, 2012. "Instrumental variable estimation with heteroskedasticity and many instruments," Quantitative Economics, Econometric Society, vol. 3(2), pages 211-255, July.
    3. Z. Birnbaum & E. Paulson & F. Andrews, 1950. "On the effect of selection performed on some coordinates of a multi-dimensional population," Psychometrika, Springer;The Psychometric Society, vol. 15(2), pages 191-204, June.
    4. William Meredith, 1993. "Measurement invariance, factor analysis and factorial invariance," Psychometrika, Springer;The Psychometric Society, vol. 58(4), pages 525-543, December.
    5. Steven Culpepper, 2012. "Using the Criterion-Predictor Factor Model to Compute the Probability of Detecting Prediction Bias with Ordinary Least Squares Regression," Psychometrika, Springer;The Psychometric Society, vol. 77(3), pages 561-580, July.
    6. Kenneth Bollen, 1996. "An alternative two stage least squares (2SLS) estimator for latent variable equations," Psychometrika, Springer;The Psychometric Society, vol. 61(1), pages 109-121, March.
    7. Roger Millsap, 2007. "Invariance in Measurement and Prediction Revisited," Psychometrika, Springer;The Psychometric Society, vol. 72(4), pages 461-473, December.
    8. Bengt Muthén & David Kaplan & Michael Hollis, 1987. "On structural equation modeling with data that are not missing completely at random," Psychometrika, Springer;The Psychometric Society, vol. 52(3), pages 431-462, September.
    9. Kenneth Bollen & Albert Maydeu-Olivares, 2007. "A Polychoric Instrumental Variable (PIV) Estimator for Structural Equation Models with Categorical Variables," Psychometrika, Springer;The Psychometric Society, vol. 72(3), pages 309-326, September.
    10. Denny Borsboom, 2006. "The attack of the psychometricians," Psychometrika, Springer;The Psychometric Society, vol. 71(3), pages 425-440, September.
    11. Sophia Rabe-Hesketh & Anders Skrondal & Andrew Pickles, 2004. "Generalized multilevel structural equation modeling," Psychometrika, Springer;The Psychometric Society, vol. 69(2), pages 167-190, June.
    12. Kenneth Bollen & Stanislav Kolenikov & Shawn Bauldry, 2014. "Model-Implied Instrumental Variable—Generalized Method of Moments (MIIV-GMM) Estimators for Latent Variable Models," Psychometrika, Springer;The Psychometric Society, vol. 79(1), pages 20-50, January.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Jeanne A. Teresi & Chun Wang & Marjorie Kleinman & Richard N. Jones & David J. Weiss, 2021. "Differential Item Functioning Analyses of the Patient-Reported Outcomes Measurement Information System (PROMIS®) Measures: Methods, Challenges, Advances, and Future Directions," Psychometrika, Springer;The Psychometric Society, vol. 86(3), pages 674-711, September.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Zachary F. Fisher & Kenneth A. Bollen, 2020. "An Instrumental Variable Estimator for Mixed Indicators: Analytic Derivatives and Alternative Parameterizations," Psychometrika, Springer;The Psychometric Society, vol. 85(3), pages 660-683, September.
    2. Shaobo Jin & Fan Yang-Wallentin & Kenneth A. Bollen, 2021. "A unified model-implied instrumental variable approach for structural equation modeling with mixed variables," Psychometrika, Springer;The Psychometric Society, vol. 86(2), pages 564-594, June.
    3. Steven Culpepper, 2012. "Using the Criterion-Predictor Factor Model to Compute the Probability of Detecting Prediction Bias with Ordinary Least Squares Regression," Psychometrika, Springer;The Psychometric Society, vol. 77(3), pages 561-580, July.
    4. Eldad Davidov & Stefan Thörner & Peter Schmidt & Stefanie Gosen & Carina Wolf, 2011. "Level and change of group-focused enmity in Germany: unconditional and conditional latent growth curve models with four panel waves," AStA Advances in Statistical Analysis, Springer;German Statistical Society, vol. 95(4), pages 481-500, December.
    5. Hayakawa, Kazuhiko, 2019. "Alternative over-identifying restriction test in the GMM estimation of panel data models," Econometrics and Statistics, Elsevier, vol. 10(C), pages 71-95.
    6. Gianmaria Bottoni, 2018. "A Multilevel Measurement Model of Social Cohesion," Social Indicators Research: An International and Interdisciplinary Journal for Quality-of-Life Measurement, Springer, vol. 136(3), pages 835-857, April.
    7. Lisa D. Wijsen & Denny Borsboom & Tiago Cabaço & Willem J. Heiser, 2019. "An Academic Genealogy of Psychometric Society Presidents," Psychometrika, Springer;The Psychometric Society, vol. 84(2), pages 562-588, June.
    8. Kano, Yutaka & Takai, Keiji, 2011. "Analysis of NMAR missing data without specifying missing-data mechanisms in a linear latent variate model," Journal of Multivariate Analysis, Elsevier, vol. 102(9), pages 1241-1255, October.
    9. Klaus Holst & Esben Budtz-Jørgensen, 2013. "Linear latent variable models: the lava-package," Computational Statistics, Springer, vol. 28(4), pages 1385-1452, August.
    10. Alexander Robitzsch, 2020. "L p Loss Functions in Invariance Alignment and Haberman Linking with Few or Many Groups," Stats, MDPI, vol. 3(3), pages 1-38, August.
    11. Steven Culpepper, 2013. "Erratum to: Using the Criterion-Predictor Factor Model to Compute the Probability of Detecting Prediction Bias with Ordinary Least Squares Regression," Psychometrika, Springer;The Psychometric Society, vol. 78(3), pages 554-555, July.
    12. Dylan Molenaar, 2015. "Heteroscedastic Latent Trait Models for Dichotomous Data," Psychometrika, Springer;The Psychometric Society, vol. 80(3), pages 625-644, September.
    13. Anna Ruelens & Bart Meuleman & Ides Nicaise, 2018. "Examining Measurement Isomorphism of Multilevel Constructs: The Case of Political Trust," Social Indicators Research: An International and Interdisciplinary Journal for Quality-of-Life Measurement, Springer, vol. 140(3), pages 907-927, December.
    14. Könül Karimova & Benő Csapó, 2021. "Cognitive and Affective Components of Verbal Self-Concepts and Internal/External Frame of Reference Within the Multidimensional Verbal Domain," SAGE Open, , vol. 11(2), pages 21582440211, May.
    15. Christian Gische & Manuel C. Voelkle, 2022. "Beyond the Mean: A Flexible Framework for Studying Causal Effects Using Linear Models," Psychometrika, Springer;The Psychometric Society, vol. 87(3), pages 868-901, September.
    16. Liu, Steven Y.H. & Deligonul, Seyda & Cavusgil, S. Tamer & Chiou, Jyh-Shen, 2021. "Addressing psychic distance and learning in international buyer-seller relationships: The role of firm exploration and asset specificity," Journal of World Business, Elsevier, vol. 56(4).
    17. Johan Oud & Manuel Voelkle, 2014. "Do missing values exist? Incomplete data handling in cross-national longitudinal studies by means of continuous time modeling," Quality & Quantity: International Journal of Methodology, Springer, vol. 48(6), pages 3271-3288, November.
    18. Morricone, Serena & Munari, Federico & Oriani, Raffaele & de Rassenfosse, Gaetan, 2017. "Commercialization Strategy and IPO Underpricing," Research Policy, Elsevier, vol. 46(6), pages 1133-1141.
    19. Liat Ayalon, 2018. "Perceived Age Discrimination: A Precipitator or a Consequence of Depressive Symptoms?," The Journals of Gerontology: Series B, The Gerontological Society of America, vol. 73(5), pages 860-869.
    20. Ihsana Sabriani Borualogo & Ferran Casas, 2023. "Bullying Victimisation and Children’s Subjective Well-being: A Comparative Study in Seven Asian Countries," Child Indicators Research, Springer;The International Society of Child Indicators (ISCI), vol. 16(1), pages 1-27, February.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:spr:psycho:v:84:y:2019:i:1:d:10.1007_s11336-018-9649-2. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.springer.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.