IDEAS home Printed from https://ideas.repec.org/p/zbw/vfsc18/181544.html
   My bibliography  Save this paper

Early Detection of Students at Risk - Predicting Student Dropouts Using Administrative Student Data and Machine Learning Methods

Author

Listed:
  • Schneider, Kerstin
  • Berens, Johannes
  • Oster, Simon
  • Burghoff, Julian

Abstract

High rates of student attrition in tertiary education are a major concern for universities and public policy, as dropout is not only costly for the students but also wastes public funds. To successfully reduce student attrition, it is imperative to understand which students are at risk of dropping out and what are the underlying determinants of dropout. We develop an early detection system (EDS) that uses machine learning and classic regression techniques to predict student success in tertiary education as a basis for a targeted intervention. The method developed in this paper is highly standardized and can be easily implemented in every German institution of higher education, as it uses student performance and demographic data collected, stored, and maintained by legal mandate at all German universities and therefore self-adjusts to the university where it is employed. The EDS uses regression analysis and machine learning methods, such as neural networks, decision trees and the AdaBoost algorithm to identify student characteristics which distinguish potential dropouts from graduates. The EDS we present is tested and applied on a medium-sized state university with 23,000 students and a medium-sized private university of applied sciences with 6,700 students. Our results indicate a prediction accuracy at the end of the 1st semester of 79% for the state university and 85% for the private university of applied sciences. Furthermore, accuracy of the EDS increases with each completed semester as new performance data becomes available. After the fourth semester, the accuracy improves to 90% for the state university and 95% for the private university of applied sciences.

Suggested Citation

  • Schneider, Kerstin & Berens, Johannes & Oster, Simon & Burghoff, Julian, 2018. "Early Detection of Students at Risk - Predicting Student Dropouts Using Administrative Student Data and Machine Learning Methods," VfS Annual Conference 2018 (Freiburg, Breisgau): Digital Economy 181544, Verein für Socialpolitik / German Economic Association.
  • Handle: RePEc:zbw:vfsc18:181544
    as

    Download full text from publisher

    File URL: https://www.econstor.eu/bitstream/10419/181544/1/VfS-2018-pid-13230.pdf
    Download Restriction: no
    ---><---

    Other versions of this item:

    References listed on IDEAS

    as
    1. Ralph Stinebrickner & Todd Stinebrickner, 2008. "The Effect of Credit Constraints on the College Drop-Out Decision: A Direct Approach Using a New Panel Study," American Economic Review, American Economic Association, vol. 98(5), pages 2163-2184, December.
    2. Antonio Mucherino & Petraq J. Papajorgji & Panos M. Pardalos, 2009. "Data Mining in Agriculture," Springer Optimization and Its Applications, Springer, number 978-0-387-88615-2, September.
    3. Ralph Stinebrickner & Todd Stinebrickner, 2014. "Academic Performance and College Dropout: Using Longitudinal Expectations Data to Estimate a Learning Model," Journal of Labor Economics, University of Chicago Press, vol. 32(3), pages 601-644.
    4. Tyler Ransom & Esteban Aucejo & Arnaud Maurel & Peter Arcidiacono, 2014. "College Attrition and the Dynamics of Information Revelation," 2014 Meeting Papers 529, Society for Economic Dynamics.
    5. Todd Stinebrickner & Ralph Stinebrickner, 2012. "Learning about Academic Ability and the College Dropout Decision," Journal of Labor Economics, University of Chicago Press, vol. 30(4), pages 707-748.
    6. Ralph Stinebrickner & Todd R. Stinebrickner, 2014. "A Major in Science? Initial Beliefs and Final Outcomes for College Major and Dropout," The Review of Economic Studies, Review of Economic Studies Ltd, vol. 81(1), pages 426-472.
    7. John Bound & Michael F. Lovenheim & Sarah Turner, 2010. "Why Have College Completion Rates Declined? An Analysis of Changing Student Preparation and Collegiate Resources," American Economic Journal: Applied Economics, American Economic Association, vol. 2(3), pages 129-157, July.
    8. Arulampalam, Wiji & Naylor, Robin A. & Smith, Jeremy P., 2005. "Effects of in-class variation and student rank on the probability of withdrawal: cross-section and time-series analysis for UK university students," Economics of Education Review, Elsevier, vol. 24(3), pages 251-262, June.
    9. Danilowicz-Gösele, Kamila & Meya, Johannes & Schwager, Robert & Suntheim, Katharina, 2014. "Determinants of students' success at university," University of Göttingen Working Papers in Economics 214, University of Goettingen, Department of Economics.
    10. Antonio Mucherino & Petraq J. Papajorgji & Panos M. Pardalos, 2009. "k-Nearest Neighbor Classification," Springer Optimization and Its Applications, in: Data Mining in Agriculture, chapter 0, pages 83-106, Springer.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Behr Andreas & Giese Marco & Teguim K Herve D. & Theune Katja, 2020. "Early Prediction of University Dropouts – A Random Forest Approach," Journal of Economics and Statistics (Jahrbuecher fuer Nationaloekonomie und Statistik), De Gruyter, vol. 240(6), pages 743-789, December.
    2. Contini, Dalit & Salza, Guido, 2020. "Too few university graduates. Inclusiveness and effectiveness of the Italian higher education system," Socio-Economic Planning Sciences, Elsevier, vol. 71(C).
    3. Isphording, Ingo E. & Raabe, Tobias, 2019. "Early Identification of College Dropouts Using Machine-Learning: Conceptual Considerations and an Empirical Example," IZA Research Reports 89, Institute of Labor Economics (IZA).

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Aina, Carmen & Baici, Eliana & Casalone, Giorgia & Pastore, Francesco, 2022. "The determinants of university dropout: A review of the socio-economic literature," Socio-Economic Planning Sciences, Elsevier, vol. 79(C).
    2. Yifan Gong & Todd Stinebrickner & Ralph Stinebrickner, 2020. "Perceived and actual option values of college enrollment," Journal of Applied Econometrics, John Wiley & Sons, Ltd., vol. 35(7), pages 940-959, November.
    3. Ralph Stinebrickner & Todd Stinebrickner, 2014. "Academic Performance and College Dropout: Using Longitudinal Expectations Data to Estimate a Learning Model," Journal of Labor Economics, University of Chicago Press, vol. 32(3), pages 601-644.
    4. Todd Stinebrickner & Ralph Stinebrickner & Paul Sullivan, 2018. "Job Tasks and the Gender Wage Gap among College Graduates," University of Western Ontario, Centre for Human Capital and Productivity (CHCP) Working Papers 20183, University of Western Ontario, Centre for Human Capital and Productivity (CHCP).
    5. Aina, Carmen & Baici, Eliana & Casalone, Giorgia & Pastore, Francesco, 2018. "The Economics of University Dropouts and Delayed Graduation: A Survey," IZA Discussion Papers 11421, Institute of Labor Economics (IZA).
    6. Ralph Stinebrickner & Todd Stinebrickner & Paul Sullivan, 2019. "Job Tasks, Time Allocation, and Wages," Journal of Labor Economics, University of Chicago Press, vol. 37(2), pages 399-433.
    7. Ralph Stinebrickner & Todd Stinebrickner & Paul Sullivan, 2019. "Beauty, Job Tasks, and Wages: A New Conclusion about Employer Taste-Based Discrimination," The Review of Economics and Statistics, MIT Press, vol. 101(4), pages 602-615, October.
    8. Matsuda, Kazushige, 2020. "Optimal timing of college subsidies: Enrollment, graduation, and the skill premium," European Economic Review, Elsevier, vol. 129(C).
    9. Gong, Yifan & Stinebrickner, Ralph & Stinebrickner, Todd, 2022. "Marriage, children, and labor supply: Beliefs and outcomes," Journal of Econometrics, Elsevier, vol. 231(1), pages 148-164.
    10. Fouarge, Didier & Heß, Pascal, 2023. "Preference-choice mismatch and university dropout," Labour Economics, Elsevier, vol. 83(C).
    11. Emanuela Ghignoni, 2015. "Family background and university dropouts during the crisis: the case of Italy," Working Papers in Public Economics 169, University of Rome La Sapienza, Department of Economics and Law.
    12. Contini,Dalit & Ricciardi,Riccardo & Romito,Marco & Salza,Guido & Zotti,Roberto, 2020. "Improving university dropout and student careers. What room for institutional action?," Department of Economics and Statistics Cognetti de Martiis. Working Papers 202004, University of Turin.
    13. Xavier D'Haultfoeuille & Christophe Gaillac & Arnaud Maurel, 2021. "Rationalizing rational expectations: Characterizations and tests," Quantitative Economics, Econometric Society, vol. 12(3), pages 817-842, July.
    14. Contini, Dalit & Salza, Guido, 2020. "Too few university graduates. Inclusiveness and effectiveness of the Italian higher education system," Socio-Economic Planning Sciences, Elsevier, vol. 71(C).
    15. Giuseppe Bertola, 2023. "University dropout problems and solutions," Journal of Economics, Springer, vol. 138(3), pages 221-248, April.
    16. Michelle Rendall & Andrew Rendall, 2013. "Math Matters: Student Ability, College Majors, and Wage Inequality," 2013 Meeting Papers 1196, Society for Economic Dynamics.
    17. Johannes S. Kunz & Kevin E. Staub, 2016. "Subjective Completion Beliefs and the Demand for Post-Secondary Education," SOEPpapers on Multidisciplinary Panel Data Research 878, DIW Berlin, The German Socio-Economic Panel (SOEP).
    18. Timothy N. Bond & George Bulman & Xiaoxiao Li & Jonathan Smith, 2018. "Updating Human Capital Decisions: Evidence from SAT Score Shocks and College Applications," Journal of Labor Economics, University of Chicago Press, vol. 36(3), pages 807-839.
    19. Backes-Gellner, Uschi & Herz, Holger & Kosfeld, Michael & Oswald, Yvonne, 2021. "Do preferences and biases predict life outcomes? Evidence from education and labor market entry decisions," European Economic Review, Elsevier, vol. 134(C).
    20. Buly A. Cardak & Joe Vecci, 2016. "Graduates, Dropouts and Slow Finishers: The Effects of Credit Constraints on University Outcomes," Oxford Bulletin of Economics and Statistics, Department of Economics, University of Oxford, vol. 78(3), pages 323-346, June.

    More about this item

    Keywords

    student attrition; early detection; administrative data; higher education; machine learning; AdaBoost;
    All these keywords.

    JEL classification:

    • I23 - Health, Education, and Welfare - - Education - - - Higher Education; Research Institutions
    • C45 - Mathematical and Quantitative Methods - - Econometric and Statistical Methods: Special Topics - - - Neural Networks and Related Topics
    • H52 - Public Economics - - National Government Expenditures and Related Policies - - - Government Expenditures and Education

    NEP fields

    This paper has been announced in the following NEP Reports:

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:zbw:vfsc18:181544. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: ZBW - Leibniz Information Centre for Economics (email available below). General contact details of provider: https://edirc.repec.org/data/vfsocea.html .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.