Deep Reinforcement Learning for Adaptive Learning Systems

My bibliography Save this article

Deep Reinforcement Learning for Adaptive Learning Systems

Author

Listed:

Xiao Li
Hanchen Xu
Jinming Zhang
(University of Illinois at Urbana-Champaign)
Hua-hua Chang
(Purdue University)

Registered:

Abstract

The adaptive learning problem concerns how to create an individualized learning plan (also referred to as a learning policy) that chooses the most appropriate learning materials based on a learnerâ€™s latent traits. In this article, we study an important yet less-addressed adaptive learning problemâ€”one that assumes continuous latent traits. Specifically, we formulate the adaptive learning problem as a Markov decision process. We assume latent traits to be continuous with an unknown transition model and apply a model-free deep reinforcement learning algorithmâ€”the deep Q-learning algorithmâ€”that can effectively find the optimal learning policy from data on learnersâ€™ learning process without knowing the actual transition model of the learnersâ€™ continuous latent traits. To efficiently utilize available data, we also develop a transition model estimator that emulates the learnerâ€™s learning process using neural networks. The transition model estimator can be used in the deep Q-learning algorithm so that it can more efficiently discover the optimal learning policy for a learner. Numerical simulation studies verify that the proposed algorithm is very efficient in finding a good learning policy. Especially with the aid of a transition model estimator, it can find the optimal learning policy after training using a small number of learners.

Suggested Citation

Xiao Li & Hanchen Xu & Jinming Zhang & Hua-hua Chang, 2023. "Deep Reinforcement Learning for Adaptive Learning Systems," Journal of Educational and Behavioral Statistics, , vol. 48(2), pages 220-243, April.

Handle: RePEc:sae:jedbes:v:48:y:2023:i:2:p:220-243
DOI: 10.3102/10769986221129847

Download full text from publisher

References listed on IDEAS

Hua-Hua Chang, 2015. "Psychometrics Behind Computerized Adaptive Testing," Psychometrika, Springer;The Psychometric Society, vol. 80(1), pages 1-20, March.
Geoff Masters, 1982. "A rasch model for partial credit scoring," Psychometrika, Springer;The Psychometric Society, vol. 47(2), pages 149-174, June.
Thomas Warm, 1989. "Weighted likelihood estimation of ability in item response theory," Psychometrika, Springer;The Psychometric Society, vol. 54(3), pages 427-450, September.
Susan Whitely, 1980. "Multicomponent latent trait models for ability tests," Psychometrika, Springer;The Psychometric Society, vol. 45(4), pages 479-494, December.
Chun Wang, 2015. "On Latent Trait Estimation in Multidimensional Compensatory Item Response Models," Psychometrika, Springer;The Psychometric Society, vol. 80(2), pages 428-449, June.
Volodymyr Mnih & Koray Kavukcuoglu & David Silver & Andrei A. Rusu & Joel Veness & Marc G. Bellemare & Alex Graves & Martin Riedmiller & Andreas K. Fidjeland & Georg Ostrovski & Stig Petersen & Charle, 2015. "Human-level control through deep reinforcement learning," Nature, Nature, vol. 518(7540), pages 529-533, February.
Jinming Zhang & Minge Xie & Xiaolan Song & Ting Lu, 2011. "Investigating the Impact of Uncertainty About Item Parameters on Ability Estimation," Psychometrika, Springer;The Psychometric Society, vol. 76(1), pages 97-118, January.
Jinming Zhang, 2013. "A Procedure for Dimensionality Analyses of Response Data from Various Test Designs," Psychometrika, Springer;The Psychometric Society, vol. 78(1), pages 37-58, January.

Full references (including those not matched with items on IDEAS)

Most related items

These are the items that most often cite the same works as this one and are cited by the same works as this one.

David Magis & Norman Verhelst, 2017. "On the Finiteness of the Weighted Likelihood Estimator of Ability," Psychometrika, Springer;The Psychometric Society, vol. 82(3), pages 637-647, September.
Janna Niens & Lisa Richter-Beuschel & Tobias C. Stubbe & Susanne Bögeholz, 2021. "Procedural Knowledge of Primary School Teachers in Madagascar for Teaching and Learning towards Land-Use- and Health-Related Sustainable Development Goals," Sustainability, MDPI, vol. 13(16), pages 1-36, August.
Chun Wang, 2015. "On Latent Trait Estimation in Multidimensional Compensatory Item Response Models," Psychometrika, Springer;The Psychometric Society, vol. 80(2), pages 428-449, June.
Marko Böhm & Jan Barkmann & Sabina Eggert & Claus H. Carstensen & Susanne Bögeholz, 2020. "Quantitative Modelling and Perspective Taking: Two Competencies of Decision Making for Sustainable Development," Sustainability, MDPI, vol. 12(17), pages 1-32, August.
Sandip Sinharay, 2015. "The Asymptotic Distribution of Ability Estimates," Journal of Educational and Behavioral Statistics, , vol. 40(5), pages 511-528, October.
David Andrich, 2010. "Sufficiency and Conditional Estimation of Person Parameters in the Polytomous Rasch Model," Psychometrika, Springer;The Psychometric Society, vol. 75(2), pages 292-308, June.
Anders Skrondal & Sophia Rabe-Hesketh, 2022. "The Role of Conditional Likelihoods in Latent Variable Modeling," Psychometrika, Springer;The Psychometric Society, vol. 87(3), pages 799-834, September.
Georg Gittler & Gerhard Fischer, 2011. "IRT-Based Measurement of Short-Term Changes of Ability, With an Application to Assessing the â€œMozart Effectâ€," Journal of Educational and Behavioral Statistics, , vol. 36(1), pages 33-75, February.
Robert Zwitser & Gunter Maris, 2015. "Conditional Statistical Inference with Multistage Testing Designs," Psychometrika, Springer;The Psychometric Society, vol. 80(1), pages 65-84, March.
Ogasawara, Haruhiko, 2013. "Asymptotic cumulants of ability estimators using fallible item parameters," Journal of Multivariate Analysis, Elsevier, vol. 119(C), pages 144-162.
David Magis & Gilles RaÃ®che & SÃ©bastien BÃ©land, 2012. "A Didactic Presentation of Snijdersâ€™s lz* Index of Person Fit With Emphasis on Response Model Selection and Ability Estimation," Journal of Educational and Behavioral Statistics, , vol. 37(1), pages 57-81, February.
Fumiko Samejima, 1997. "Departure from normal assumptions: A promise for future psychometrics with substantive mathematical modeling," Psychometrika, Springer;The Psychometric Society, vol. 62(4), pages 471-493, December.
Tabea Feseker & Timo Gnambs & Cordula Artelt, 2021. "Setting a standard for low reading proficiency: A comparison of the bookmark procedure and constrained mixture Rasch model," PLOS ONE, Public Library of Science, vol. 16(11), pages 1-22, November.
Chun Wang & Gongjun Xu & Xue Zhang, 2019. "Correction for Item Response Theory Latent Trait Measurement Error in Linear Mixed Effects Models," Psychometrika, Springer;The Psychometric Society, vol. 84(3), pages 673-700, September.
David Magis, 2015. "A Note on Weighted Likelihood and Jeffreys Modal Estimation of Proficiency Levels in Polytomous Item Response Models," Psychometrika, Springer;The Psychometric Society, vol. 80(1), pages 200-204, March.
Maxwell Hong & Lizhen Lin & Ying Cheng, 2021. "Asymptotically Corrected Person Fit Statistics for Multidimensional Constructs with Simple Structure and Mixed Item Types," Psychometrika, Springer;The Psychometric Society, vol. 86(2), pages 464-488, June.
César Merino-Soto & Gina Chávez-Ventura & Verónica López-Fernández & Guillermo M. Chans & Filiberto Toledano-Toledano, 2022. "Learning Self-Regulation Questionnaire (SRQ-L): Psychometric and Measurement Invariance Evidence in Peruvian Undergraduate Students," Sustainability, MDPI, vol. 14(18), pages 1-17, September.
Tulika Saha & Sriparna Saha & Pushpak Bhattacharyya, 2020. "Towards sentiment aided dialogue policy learning for multi-intent conversations using hierarchical reinforcement learning," PLOS ONE, Public Library of Science, vol. 15(7), pages 1-28, July.
Nana Kim & Daniel M. Bolt & James Wollack, 2022. "Noncompensatory MIRT For Passage-Based Tests," Psychometrika, Springer;The Psychometric Society, vol. 87(3), pages 992-1009, September.
Mahmoud Mahfouz & Angelos Filos & Cyrine Chtourou & Joshua Lockhart & Samuel Assefa & Manuela Veloso & Danilo Mandic & Tucker Balch, 2019. "On the Importance of Opponent Modeling in Auction Markets," Papers 1911.12816, arXiv.org.

More about this item

Keywords

adaptive learning system; transition model estimator; Markov decision process; deep reinforcement learning; deep Q-learning; neural networks; model free;
All these keywords.

Statistics

Access and download statistics

Corrections

All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:sae:jedbes:v:48:y:2023:i:2:p:220-243. See general information about how to correct material in RePEc.

If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: SAGE Publications (email available below). General contact details of provider: .

Please note that corrections may take a couple of weeks to filter through the various RePEc services.

IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.

Browse Econ Literature

More features

Deep Reinforcement Learning for Adaptive Learning Systems

Author

Abstract

Suggested Citation

Download full text from publisher

References listed on IDEAS

Most related items

More about this item

Keywords

Statistics

Corrections

More services and features

MyIDEAS

Author registration

Rankings

RePEc Genealogy

RePEc Biblio

MPRA

New papers by email

EconAcademics

Plagiarism

About RePEc

RePEc home

Blog

Help/FAQ

RePEc team

Participating archives

Privacy statement

Help us

Corrections

Volunteers

Get papers listed

Open a RePEc archive

Get RePEc data