IDEAS home Printed from https://ideas.repec.org/a/gam/jdataj/v11y2026i3p57-d1895927.html

Leveraging Transformers and LLMs for Automated Grading and Feedback Generation Using a Novel Dataset

Author

Listed:
  • Asmaa G. Khalf

    (Faculty of Computers and Artificial Intelligence, Beni-Suef University, Beni-Suef 62511, Egypt)

  • Emad Nabil

    (Faculty of Computer and Information Systems, Islamic University of Madinah, Madinah 42351, Saudi Arabia)

  • Wael H. Gomaa

    (Faculty of Computers and Artificial Intelligence, Beni-Suef University, Beni-Suef 62511, Egypt)

  • Oussama Benrhouma

    (Faculty of Computer and Information Systems, Islamic University of Madinah, Madinah 42351, Saudi Arabia)

  • Amira M. El-Mandouh

    (Faculty of Computers and Artificial Intelligence, Beni-Suef University, Beni-Suef 62511, Egypt)

Abstract

Automated Short Answer Grading (ASAG) has garnered significant attention in the field of educational technology due to its potential to improve the efficiency, scalability, and consistency of student assessments. This study introduces a novel dataset of 651 student responses from a Database Transaction course exam at Beni-Suef University, referred to as the Beni-Suef Transaction Processing (BeSTraP) dataset. The BeSTraP is specifically designed to support ASAG evaluation. To assess ASAG performance, five approaches were employed: string-based similarity, semantic similarity, a hybrid of both, fine-tuning transformer-based models, and the application of Large Language Models (LLMs). The experimental results indicated that fine-tuned transformers, particularly GPT-2, achieved the highest Pearson correlation with human scores (0.8813) on the new dataset and maintained robust performance on the Mohler benchmark (0.7834). In addition to grading, the framework integrates automated feedback generation through LLMs, further enriching the assessment process. This research contributes (i) a novel, domain-specific dataset derived from an actual university examination, (ii) a comprehensive comparison of traditional and transformer-based approaches, and (iii) evidence of the efficacy of fine-tuned models in providing accurate and scalable grading solutions. The created dataset will be publicly available for the community.

Suggested Citation

  • Asmaa G. Khalf & Emad Nabil & Wael H. Gomaa & Oussama Benrhouma & Amira M. El-Mandouh, 2026. "Leveraging Transformers and LLMs for Automated Grading and Feedback Generation Using a Novel Dataset," Data, MDPI, vol. 11(3), pages 1-32, March.
  • Handle: RePEc:gam:jdataj:v:11:y:2026:i:3:p:57-:d:1895927
    as

    Download full text from publisher

    File URL: https://www.mdpi.com/2306-5729/11/3/57/pdf
    Download Restriction: no

    File URL: https://www.mdpi.com/2306-5729/11/3/57/
    Download Restriction: no
    ---><---

    More about this item

    Keywords

    ;
    ;
    ;
    ;
    ;
    ;

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:gam:jdataj:v:11:y:2026:i:3:p:57-:d:1895927. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    We have no bibliographic references for this item. You can help adding them by using this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: MDPI Indexing Manager The email address of this maintainer does not seem to be valid anymore. Please ask MDPI Indexing Manager to update the entry or send us the correct address (email available below). General contact details of provider: https://www.mdpi.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.