IDEAS home Printed from https://ideas.repec.org/a/eee/ejores/v269y2018i3p1072-1085.html
   My bibliography  Save this article

Student and school performance across countries: A machine learning approach

Author

Listed:
  • Masci, Chiara
  • Johnes, Geraint
  • Agasisti, Tommaso

Abstract

In this paper, we develop and apply novel machine learning and statistical methods to analyse the determinants of students’ PISA 2015 test scores in nine countries: Australia, Canada, France, Germany, Italy, Japan, Spain, UK and USA. The aim is to find out which student characteristics are associated with test scores and which school characteristics are associated to school value-added (measured at school level). A specific aim of our approach is to explore non-linearities in the associations between covariates and test scores, as well as to model interactions between school-level factors in affecting results. In order to address these issues, we apply a two-stage methodology using flexible tree-based methods. We first run multilevel regression trees in the first stage, to estimate school value-added. In the second stage, we relate the estimated school value-added to school level variables by means of regression trees and boosting. Results show that while several student and school level characteristics are significantly associated to students’ achievements, there are marked differences across countries. The proposed approach allows an improved description of the structurally different educational production functions across countries.

Suggested Citation

  • Masci, Chiara & Johnes, Geraint & Agasisti, Tommaso, 2018. "Student and school performance across countries: A machine learning approach," European Journal of Operational Research, Elsevier, vol. 269(3), pages 1072-1085.
  • Handle: RePEc:eee:ejores:v:269:y:2018:i:3:p:1072-1085
    DOI: 10.1016/j.ejor.2018.02.031
    as

    Download full text from publisher

    File URL: http://www.sciencedirect.com/science/article/pii/S0377221718301462
    Download Restriction: Full text for ScienceDirect subscribers only

    File URL: https://libkey.io/10.1016/j.ejor.2018.02.031?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Steven G. Rivkin & Eric A. Hanushek & John F. Kain, 2005. "Teachers, Schools, and Academic Achievement," Econometrica, Econometric Society, vol. 73(2), pages 417-458, March.
    2. Fitzpatrick, Trevor & Mues, Christophe, 2016. "An empirical comparison of classification algorithms for mortgage default prediction: evidence from a distressed mortgage market," European Journal of Operational Research, Elsevier, vol. 249(2), pages 427-439.
    3. Hanushek, Eric A & Rivkin, Steven G & Taylor, Lori L, 1996. "Aggregation and the Estimated Effects of School Resources," The Review of Economics and Statistics, MIT Press, vol. 78(4), pages 611-627, November.
    4. C. Masci & F. Ieva & T. Agasisti & A. M. Paganoni, 2017. "Bivariate multilevel models for the analysis of mathematics and reading pupils' achievements," Journal of Applied Statistics, Taylor & Francis Journals, vol. 44(7), pages 1296-1317, May.
    5. Stephen W. Raudenbush, 1988. "Educational Applications of Hierarchical Linear Models: A Review," Journal of Educational and Behavioral Statistics, , vol. 13(2), pages 85-116, June.
    6. Ian Plewis, 2011. "Contextual variations in ethnic group differences in educational attainments," Journal of the Royal Statistical Society Series A, Royal Statistical Society, vol. 174(2), pages 419-437, April.
    7. Masci, Chiara & Ieva, Francesca & Agasisti, Tommaso & Paganoni, Anna Maria, 2016. "Does class matter more than school? Evidence from a multilevel statistical analysis on Italian junior secondary school students," Socio-Economic Planning Sciences, Elsevier, vol. 54(C), pages 47-57.
    8. Sendhil Mullainathan & Jann Spiess, 2017. "Machine Learning: An Applied Econometric Approach," Journal of Economic Perspectives, American Economic Association, vol. 31(2), pages 87-106, Spring.
    9. Hal R. Varian, 2014. "Big Data: New Tricks for Econometrics," Journal of Economic Perspectives, American Economic Association, vol. 28(2), pages 3-28, Spring.
    10. Savona, Roberto, 2014. "Hedge fund systemic risk signals," European Journal of Operational Research, Elsevier, vol. 236(1), pages 282-291.
    11. Tommaso Agasisti & Francesca Ieva & Anna Maria Paganoni, 2017. "Heterogeneity, school-effects and the North/South achievement gap in Italian secondary education: evidence from a three-level mixed model," Statistical Methods & Applications, Springer;Società Italiana di Statistica, vol. 26(1), pages 157-180, March.
    12. Joshua D. Angrist & Victor Lavy, 1999. "Using Maimonides' Rule to Estimate the Effect of Class Size on Scholastic Achievement," The Quarterly Journal of Economics, President and Fellows of Harvard College, vol. 114(2), pages 533-575.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Bottmer, Lea & Croux, Christophe & Wilms, Ines, 2022. "Sparse regression for large data sets with outliers," European Journal of Operational Research, Elsevier, vol. 297(2), pages 782-794.
    2. Giménez, Víctor & Thieme, Claudio & Prior, Diego & Tortosa-Ausina, Emili, 2022. "Evaluation and determinants of preschool effectiveness in Chile," Socio-Economic Planning Sciences, Elsevier, vol. 81(C).
    3. Antonella D’Agostino & Francesco Schirripa Spagnolo & Nicola Salvati, 2022. "Studying the relationship between anxiety and school achievement: evidence from PISA data," Statistical Methods & Applications, Springer;Società Italiana di Statistica, vol. 31(1), pages 1-20, March.
    4. Yong Shi & Wei Dai & Wen Long & Bo Li, 2021. "Deep Kernel Gaussian Process Based Financial Market Predictions," Papers 2105.12293, arXiv.org.
    5. Rebai, Sonia & Ben Yahia, Fatma & Essid, Hédi, 2020. "A graphically based machine learning approach to predict secondary schools performance in Tunisia," Socio-Economic Planning Sciences, Elsevier, vol. 70(C).
    6. Tsionas, Mike, 2022. "Efficiency estimation using probabilistic regression trees with an application to Chilean manufacturing industries," International Journal of Production Economics, Elsevier, vol. 249(C).
    7. Van Nguyen, Truong & Zhou, Li & Chong, Alain Yee Loong & Li, Boying & Pu, Xiaodie, 2020. "Predicting customer demand for remanufactured products: A data-mining approach," European Journal of Operational Research, Elsevier, vol. 281(3), pages 543-558.
    8. Camanho, Ana S. & Varriale, Luisa & Barbosa, Flávia & Sobral, Thiago, 2021. "Performance assessment of upper secondary schools in Italian regions using a circular pseudo-Malmquist index," European Journal of Operational Research, Elsevier, vol. 289(3), pages 1188-1208.
    9. Alice Bertoletti & Marta Cannistrà & Melisa Diaz Lema & Chiara Masci & Anna Mergoni & Lidia Rossi & Mara Soncin, 2023. "The Determinants of Mathematics Achievement: A Gender Perspective Using Multilevel Random Forest," Economies, MDPI, vol. 11(2), pages 1-20, January.
    10. Joyce de Souza Zanirato Maia & Ana Paula Arantes Bueno & Joao Ricardo Sato, 2023. "Applications of Artificial Intelligence Models in Educational Analytics and Decision Making: A Systematic Review," World, MDPI, vol. 4(2), pages 1-26, May.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Chiara Masci & Francesca Ieva & Tommaso Agasisti & Anna Maria Paganoni, 2021. "Evaluating class and school effects on the joint student achievements in different subjects: a bivariate semiparametric model with random coefficients," Computational Statistics, Springer, vol. 36(4), pages 2337-2377, December.
    2. Arthur Charpentier & Emmanuel Flachaire & Antoine Ly, 2017. "Econom\'etrie et Machine Learning," Papers 1708.06992, arXiv.org, revised Mar 2018.
    3. Eric A. Hanushek, "undated". "The Evidence on Class Size," Wallis Working Papers WP10, University of Rochester - Wallis Institute of Political Economy.
    4. Filmer,Deon P. & Nahata,Vatsal & Sabarwal,Shwetlena, 2021. "Preparation, Practice, and Beliefs : A Machine Learning Approach to Understanding Teacher Effectiveness," Policy Research Working Paper Series 9847, The World Bank.
    5. Arthur Charpentier & Emmanuel Flachaire & Antoine Ly, 2018. "Économétrie & Machine Learning," Working Papers hal-01568851, HAL.
    6. Bernal, Pedro & Mittag, Nikolas & Qureshi, Javaeria A., 2016. "Estimating effects of school quality using multiple proxies," Labour Economics, Elsevier, vol. 39(C), pages 1-10.
    7. Meghir, Costas & Rivkin, Steven, 2011. "Econometric Methods for Research in Education," Handbook of the Economics of Education, in: Erik Hanushek & Stephen Machin & Ludger Woessmann (ed.), Handbook of the Economics of Education, edition 1, volume 3, chapter 1, pages 1-87, Elsevier.
    8. Chen, Shunqin & Guo, Zhengfeng & Zhao, Xinlei, 2021. "Predicting mortgage early delinquency with machine learning methods," European Journal of Operational Research, Elsevier, vol. 290(1), pages 358-372.
    9. Dante Contreras & Daniel Hojman & Manuel Matas & Patricio Rodríguez & Nicolás Suárez, 2018. "The impact of commuting time over educational achievement: A machine learning approach," Working Papers wp472, University of Chile, Department of Economics.
    10. Sophie-Charlotte Klose & Johannes Lederer, 2020. "A Pipeline for Variable Selection and False Discovery Rate Control With an Application in Labor Economics," Papers 2006.12296, arXiv.org, revised Jun 2020.
    11. Barrow, Lisa & Rouse, Cecilia Elena, 2004. "Using market valuation to assess public school spending," Journal of Public Economics, Elsevier, vol. 88(9-10), pages 1747-1769, August.
    12. Maria Iacovou, 2002. "Class Size in the Early Years: Is Smaller Really Better?," Education Economics, Taylor & Francis Journals, vol. 10(3), pages 261-290.
    13. Justin L. Tobias & Mingliang Li, 2003. "A finite-sample hierarchical analysis of wage variation across public high schools: evidence from the NLSY and high school and beyond," Journal of Applied Econometrics, John Wiley & Sons, Ltd., vol. 18(3), pages 315-336.
    14. Akash Malhotra, 2018. "A hybrid econometric-machine learning approach for relative importance analysis: Prioritizing food policy," Papers 1806.04517, arXiv.org, revised Aug 2020.
    15. Gilpin, Gregory A., 2012. "Teacher salaries and teacher aptitude: An analysis using quantile regressions," Economics of Education Review, Elsevier, vol. 31(3), pages 15-29.
    16. Kevin C. Bastian & Gary T. Henry & Charles L. Thompson, 2013. "Incorporating Access to More Effective Teachers into Assessments of Educational Resource Equity," Education Finance and Policy, MIT Press, vol. 8(4), pages 560-580, October.
    17. Michael Bates & Michael Dinerstein & Andrew C. Johnston & Isaac Sorkin, 2022. "Teacher Labor Market Equilibrium and Student Achievement," CESifo Working Paper Series 9551, CESifo.
    18. Pascal Bressoux & Francis Kramarz & Corinne Prost, 2009. "Teachers’ Training, Class Size and Students’ Outcomes: Learning from Administrative Forecasting Mistakes," Economic Journal, Royal Economic Society, vol. 119(536), pages 540-561, March.
    19. Lidia Ceriani & Sergio Olivieri & Marco Ranzani, 2023. "Housing, imputed rent, and household welfare," The Journal of Economic Inequality, Springer;Society for the Study of Economic Inequality, vol. 21(1), pages 131-168, March.
    20. Croux, Christophe & Jagtiani, Julapa & Korivi, Tarunsai & Vulanovic, Milos, 2020. "Important factors determining Fintech loan default: Evidence from a lendingclub consumer platform," Journal of Economic Behavior & Organization, Elsevier, vol. 173(C), pages 270-296.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:ejores:v:269:y:2018:i:3:p:1072-1085. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Catherine Liu (email available below). General contact details of provider: http://www.elsevier.com/locate/eor .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.