IDEAS home Printed from https://ideas.repec.org/a/spr/psycho/v88y2023i4d10.1007_s11336-022-09883-5.html
   My bibliography  Save this article

Power Analysis for the Wald, LR, Score, and Gradient Tests in a Marginal Maximum Likelihood Framework: Applications in IRT

Author

Listed:
  • Felix Zimmer

    (University of Zurich)

  • Clemens Draxler

    (The Health and Life Sciences University)

  • Rudolf Debelak

    (University of Zurich)

Abstract

The Wald, likelihood ratio, score, and the recently proposed gradient statistics can be used to assess a broad range of hypotheses in item response theory models, for instance, to check the overall model fit or to detect differential item functioning. We introduce new methods for power analysis and sample size planning that can be applied when marginal maximum likelihood estimation is used. This allows the application to a variety of IRT models, which are commonly used in practice, e.g., in large-scale educational assessments. An analytical method utilizes the asymptotic distributions of the statistics under alternative hypotheses. We also provide a sampling-based approach for applications where the analytical approach is computationally infeasible. This can be the case with 20 or more items, since the computational load increases exponentially with the number of items. We performed extensive simulation studies in three practically relevant settings, i.e., testing a Rasch model against a 2PL model, testing for differential item functioning, and testing a partial credit model against a generalized partial credit model. The observed distributions of the test statistics and the power of the tests agreed well with the predictions by the proposed methods in sufficiently large samples. We provide an openly accessible R package that implements the methods for user-supplied hypotheses.

Suggested Citation

  • Felix Zimmer & Clemens Draxler & Rudolf Debelak, 2023. "Power Analysis for the Wald, LR, Score, and Gradient Tests in a Marginal Maximum Likelihood Framework: Applications in IRT," Psychometrika, Springer;The Psychometric Society, vol. 88(4), pages 1249-1298, December.
  • Handle: RePEc:spr:psycho:v:88:y:2023:i:4:d:10.1007_s11336-022-09883-5
    DOI: 10.1007/s11336-022-09883-5
    as

    Download full text from publisher

    File URL: http://link.springer.com/10.1007/s11336-022-09883-5
    File Function: Abstract
    Download Restriction: Access to the full text of the articles in this series is restricted.

    File URL: https://libkey.io/10.1007/s11336-022-09883-5?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Edgar Merkle & Jinyan Fan & Achim Zeileis, 2014. "Testing for Measurement Invariance with Respect to an Ordinal Variable," Psychometrika, Springer;The Psychometric Society, vol. 79(4), pages 569-584, October.
    2. D. Oakes, 1999. "Direct calculation of the information matrix via the EM," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 61(2), pages 479-482, April.
    3. Theo Eggen, 2000. "On the loss of information in conditional maximum likelihood estimation of item parameters," Psychometrika, Springer;The Psychometric Society, vol. 65(3), pages 337-362, September.
    4. Geoff Masters, 1982. "A rasch model for partial credit scoring," Psychometrika, Springer;The Psychometric Society, vol. 47(2), pages 149-174, June.
    5. R. Bock & Murray Aitkin, 1981. "Marginal maximum likelihood estimation of item parameters: Application of an EM algorithm," Psychometrika, Springer;The Psychometric Society, vol. 46(4), pages 443-459, December.
    6. Ke-Hai Yuan & Ying Cheng & Jeff Patton, 2014. "Information Matrices and Standard Errors for MLEs of Item Parameters in IRT," Psychometrika, Springer;The Psychometric Society, vol. 79(2), pages 232-254, April.
    7. Shelby J. Haberman & Sandip Sinharay, 2013. "Generalized Residuals for General Models for Contingency Tables With Application to Item Response Theory," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 108(504), pages 1435-1444, December.
    8. Clemens Draxler & Rainer Alexandrowicz, 2015. "Sample Size Determination Within the Scope of Conditional Maximum Likelihood Estimation with Special Focus on Testing the Rasch Model," Psychometrika, Springer;The Psychometric Society, vol. 80(4), pages 897-919, December.
    9. Katsikatsou, Myrsini & Moustaki, Irini & Yang-Wallentin, Fan & Jöreskog, Karl G., 2012. "Pairwise likelihood estimation for factor analysis models with ordinal data," Computational Statistics & Data Analysis, Elsevier, vol. 56(12), pages 4243-4258.
    10. Alberto Maydeu-Olivares & Rosa Montaño, 2013. "How Should We Assess the Fit of Rasch-Type Models? Approximating the Power of Goodness-of-Fit Statistics in Categorical Data Analysis," Psychometrika, Springer;The Psychometric Society, vol. 78(1), pages 116-133, January.
    11. Cees Glas, 1999. "Modification indices for the 2-PL and the nominal response model," Psychometrika, Springer;The Psychometric Society, vol. 64(3), pages 273-294, September.
    12. C. Glas & Anna Dagohoy, 2007. "A Person Fit Test For Irt Models For Polytomous Items," Psychometrika, Springer;The Psychometric Society, vol. 72(2), pages 159-180, June.
    13. Chalmers, R. Philip, 2012. "mirt: A Multidimensional Item Response Theory Package for the R Environment," Journal of Statistical Software, Foundation for Open Access Statistics, vol. 48(i06).
    14. Edgar Merkle & Achim Zeileis, 2013. "Tests of Measurement Invariance Without Subgroups: A Generalization of Classical Methods," Psychometrika, Springer;The Psychometric Society, vol. 78(1), pages 59-82, January.
    15. Clemens Draxler, 2010. "Sample Size Determination for Rasch Model Tests," Psychometrika, Springer;The Psychometric Society, vol. 75(4), pages 708-724, December.
    16. Artur Lemonte & Silvia Ferrari, 2012. "The local power of the gradient test," Annals of the Institute of Statistical Mathematics, Springer;The Institute of Statistical Mathematics, vol. 64(2), pages 373-381, April.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Ting Wang & Carolin Strobl & Achim Zeileis & Edgar C. Merkle, 2018. "Score-Based Tests of Differential Item Functioning via Pairwise Maximum Likelihood Estimation," Psychometrika, Springer;The Psychometric Society, vol. 83(1), pages 132-155, March.
    2. Alexander Robitzsch, 2021. "A Comprehensive Simulation Study of Estimation Methods for the Rasch Model," Stats, MDPI, vol. 4(4), pages 1-23, October.
    3. Ting Wang & Benjamin Graves & Yves Rosseel & Edgar C. Merkle, 2022. "Computation and application of generalized linear mixed model derivatives using lme4," Psychometrika, Springer;The Psychometric Society, vol. 87(3), pages 1173-1193, September.
    4. Björn Andersson & Tao Xin, 2021. "Estimation of Latent Regression Item Response Theory Models Using a Second-Order Laplace Approximation," Journal of Educational and Behavioral Statistics, , vol. 46(2), pages 244-265, April.
    5. C. Glas & Anna Dagohoy, 2007. "A Person Fit Test For Irt Models For Polytomous Items," Psychometrika, Springer;The Psychometric Society, vol. 72(2), pages 159-180, June.
    6. Dylan Molenaar, 2015. "Heteroscedastic Latent Trait Models for Dichotomous Data," Psychometrika, Springer;The Psychometric Society, vol. 80(3), pages 625-644, September.
    7. Sara Fernandes & Guillaume Fond & Xavier Zendjidjian & Pierre Michel & Karine Baumstarck & Christophe Lançon & Ludovic Samalin & Pierre-Michel Llorca & Magali Coldefy & Pascal Auquier & Laurent Boyer , 2022. "Development and Calibration of the PREMIUM Item Bank for Measuring Respect and Dignity for Patients with Severe Mental Illness," Post-Print hal-03649277, HAL.
    8. Yang Liu & Ji Seung Yang & Alberto Maydeu-Olivares, 2019. "Restricted Recalibration of Item Response Theory Models," Psychometrika, Springer;The Psychometric Society, vol. 84(2), pages 529-553, June.
    9. Albert Maydeu-Olivares & Harry Joe, 2006. "Limited Information Goodness-of-fit Testing in Multidimensional Contingency Tables," Psychometrika, Springer;The Psychometric Society, vol. 71(4), pages 713-732, December.
    10. Yang Liu & Jan Hannig, 2017. "Generalized Fiducial Inference for Logistic Graded Response Models," Psychometrika, Springer;The Psychometric Society, vol. 82(4), pages 1097-1125, December.
    11. Scott Monroe, 2019. "Estimation of Expected Fisher Information for IRT Models," Journal of Educational and Behavioral Statistics, , vol. 44(4), pages 431-447, August.
    12. Clemens Draxler & Rainer Alexandrowicz, 2015. "Sample Size Determination Within the Scope of Conditional Maximum Likelihood Estimation with Special Focus on Testing the Rasch Model," Psychometrika, Springer;The Psychometric Society, vol. 80(4), pages 897-919, December.
    13. R. Philip Chalmers, 2018. "Model-Based Measures for Detecting and Quantifying Response Bias," Psychometrika, Springer;The Psychometric Society, vol. 83(3), pages 696-732, September.
    14. Melissa Gladstone & Gillian Lancaster & Gareth McCray & Vanessa Cavallera & Claudia R. L. Alves & Limbika Maliwichi & Muneera A. Rasheed & Tarun Dua & Magdalena Janus & Patricia Kariger, 2021. "Validation of the Infant and Young Child Development (IYCD) Indicators in Three Countries: Brazil, Malawi and Pakistan," IJERPH, MDPI, vol. 18(11), pages 1-19, June.
    15. Payton J. Jones & Patrick Mair & Thorsten Simon & Achim Zeileis, 2020. "Network Trees: A Method for Recursively Partitioning Covariance Structures," Psychometrika, Springer;The Psychometric Society, vol. 85(4), pages 926-945, December.
    16. Clemens Draxler, 2018. "Bayesian conditional inference for Rasch models," AStA Advances in Statistical Analysis, Springer;German Statistical Society, vol. 102(2), pages 245-262, April.
    17. Gerhard Tutz & Gunther Schauberger, 2015. "A Penalty Approach to Differential Item Functioning in Rasch Models," Psychometrika, Springer;The Psychometric Society, vol. 80(1), pages 21-43, March.
    18. Battauz, Michela & Vidoni, Paolo, 2022. "A likelihood-based boosting algorithm for factor analysis models with binary data," Computational Statistics & Data Analysis, Elsevier, vol. 168(C).
    19. Georg Gittler & Gerhard Fischer, 2011. "IRT-Based Measurement of Short-Term Changes of Ability, With an Application to Assessing the “Mozart Effectâ€," Journal of Educational and Behavioral Statistics, , vol. 36(1), pages 33-75, February.
    20. Robert Zwitser & Gunter Maris, 2015. "Conditional Statistical Inference with Multistage Testing Designs," Psychometrika, Springer;The Psychometric Society, vol. 80(1), pages 65-84, March.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:spr:psycho:v:88:y:2023:i:4:d:10.1007_s11336-022-09883-5. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.springer.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.