IDEAS home Printed from https://ideas.repec.org/a/gam/jdataj/v6y2021i7p77-d597140.html
   My bibliography  Save this article

Dealing with Randomness and Concept Drift in Large Datasets

Author

Listed:
  • Kassim S. Mwitondi

    (Industry & Innovation Research Institute, College of Business, Technology & Engineering, Sheffield Hallam University, 9410 Cantor Building, City Campus, 153 Arundel Street, Sheffield S1 2NU, UK)

  • Raed A. Said

    (Faculty of Management, Canadian University Dubai, Al Safa Street-Al Wasl, City Walk Mall, Dubai P.O. Box 415053, United Arab Emirates)

Abstract

Data-driven solutions to societal challenges continue to bring new dimensions to our daily lives. For example, while good-quality education is a well-acknowledged foundation of sustainable development, innovation and creativity, variations in student attainment and general performance remain commonplace. Developing data -driven solutions hinges on two fronts-technical and application. The former relates to the modelling perspective, where two of the major challenges are the impact of data randomness and general variations in definitions, typically referred to as concept drift in machine learning. The latter relates to devising data-driven solutions to address real-life challenges such as identifying potential triggers of pedagogical performance, which aligns with the Sustainable Development Goal (SDG) #4-Quality Education. A total of 3145 pedagogical data points were obtained from the central data collection platform for the United Arab Emirates (UAE) Ministry of Education (MoE). Using simple data visualisation and machine learning techniques via a generic algorithm for sampling, measuring and assessing, the paper highlights research pathways for educationists and data scientists to attain unified goals in an interdisciplinary context. Its novelty derives from embedded capacity to address data randomness and concept drift by minimising modelling variations and yielding consistent results across samples. Results show that intricate relationships among data attributes describe the invariant conditions that practitioners in the two overlapping fields of data science and education must identify.

Suggested Citation

  • Kassim S. Mwitondi & Raed A. Said, 2021. "Dealing with Randomness and Concept Drift in Large Datasets," Data, MDPI, vol. 6(7), pages 1-19, July.
  • Handle: RePEc:gam:jdataj:v:6:y:2021:i:7:p:77-:d:597140
    as

    Download full text from publisher

    File URL: https://www.mdpi.com/2306-5729/6/7/77/pdf
    Download Restriction: no

    File URL: https://www.mdpi.com/2306-5729/6/7/77/
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Robert Lerman, 2019. "Do firms benefit from apprenticeship investments?," IZA World of Labor, Institute of Labor Economics (IZA), pages 1-55, October.
    2. Sarah Randall Johnson & Frances King Stage, 2018. "Academic Engagement and Student Success: Do High-Impact Practices Mean Higher Graduation Rates?," The Journal of Higher Education, Taylor & Francis Journals, vol. 89(5), pages 753-781, September.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Cédric Beaulac & Jeffrey S. Rosenthal, 2019. "Predicting University Students’ Academic Success and Major Using Random Forests," Research in Higher Education, Springer;Association for Institutional Research, vol. 60(7), pages 1048-1064, November.
    2. Dorner, Matthias & Görlitz, Katja, 2020. "Training, wages and a missing school graduation cohort," IAB-Discussion Paper 202028, Institut für Arbeitsmarkt- und Berufsforschung (IAB), Nürnberg [Institute for Employment Research, Nuremberg, Germany].
    3. Nick Huntington-Klein & Andrew Gill, 2021. "Semester Course Load and Student Performance," Research in Higher Education, Springer;Association for Institutional Research, vol. 62(5), pages 623-650, August.
    4. Werner Eichhorst & Núria Rodríguez-Planas & Ricarda Schmidl & Klaus F. Zimmermann, 2015. "A Road Map to Vocational Education and Training in Industrialized Countries," ILR Review, Cornell University, ILR School, vol. 68(2), pages 314-337, March.
    5. Qian Meng & Qi Zhang, 2023. "The Influence of Academic Self-Efficacy on University Students’ Academic Performance: The Mediating Effect of Academic Engagement," Sustainability, MDPI, vol. 15(7), pages 1-14, March.
    6. Camelia Truta & Luminita Parv & Ioana Topala, 2018. "Academic Engagement and Intention to Drop Out: Levers for Sustainability in Higher Education," Sustainability, MDPI, vol. 10(12), pages 1-11, December.
    7. Annie M. Wofford, 2022. "The Perpetuation of Privilege: Exploring the Relationship Between Early Admissions and High-Impact Practices," Research in Higher Education, Springer;Association for Institutional Research, vol. 63(8), pages 1312-1342, December.
    8. Sofie Cabus & Eszter Nagy, 2021. "On the productivity effects of training apprentices in Hungary: evidence from a unique matched employer–employee dataset," Empirical Economics, Springer, vol. 60(4), pages 1685-1718, April.
    9. Brian P. An & Chad N. Loes, 2023. "Participation in High-Impact Practices: Considering the Role of Institutional Context and a Person-Centered Approach," Research in Higher Education, Springer;Association for Institutional Research, vol. 64(4), pages 520-546, June.
    10. Gaye D. Ceyhan & Alia N. Thompson & Jeremy D. Sloane & Jason R. Wiles & Sule Aksoy & John W. Tillotson, 2019. "The Socialization and Retention of Low-Income College Students: The Impact of a Wrap-Around Intervention," International Journal of Higher Education, Sciedu Press, vol. 8(6), pages 249-249, December.
    11. Sofie Cabus & Eszter Nagy, 2017. "Performance of Hungarian firms: are apprentices an asset or a liability? Evidence from a unique matched employer-employee dataset," Budapest Working Papers on the Labour Market 1706, Institute of Economics, Centre for Economic and Regional Studies.
    12. Rachana Bhatt & Angela Bell & Donald L. Rubin & Coryn Shiflet & Leslie Hodges, 2022. "Education Abroad and College Completion," Research in Higher Education, Springer;Association for Institutional Research, vol. 63(6), pages 987-1014, September.
    13. Lerman, Robert I. & Loprest, Pamela J. & Kuehn, Daniel, 2020. "Training for Jobs of the Future: Improving Access, Certifying Skills, and Expanding Apprenticeship," IZA Policy Papers 166, Institute of Labor Economics (IZA).

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:gam:jdataj:v:6:y:2021:i:7:p:77-:d:597140. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: MDPI Indexing Manager (email available below). General contact details of provider: https://www.mdpi.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.