IDEAS home Printed from https://ideas.repec.org/p/fip/fedgfe/102367.html

LLM on a Budget: Active Knowledge Distillation for Efficient Classification of Large Text Corpora

Author

Listed:

Abstract

Large Language Models (LLMs) are highly accurate in classification tasks, however, substantial computational and financial costs hinder their large-scale deployment in dynamic environments. Knowledge Distillation (KD) where a LLM ""teacher"" trains a smaller and more efficient ""student"" model, offers a promising solution to this problem. However, the distillation process itself often remains costly for large datasets, since it requires the teacher to label a vast number of samples while incurring significant token consumption. To alleviate this challenge, in this work we explore the active learning (AL) as a way to create efficient student models at a fraction of the cost while preserving the LLM's performance. In particular, we introduce M-RARU (Multi-class Randomized Accept/Reject Uncertainty Sampling), a novel AL algorithm that significantly reduces training costs. M-RARU employs an innovative strategy combining uncertainty with a randomized accept-reject mechanism to select only the most informative data points for the LLM teacher. This focused approach significantly minimizes required API calls and data processing time. We evaluate M-RARU against random sampling across five diverse student models (SVM, LDA, RF, GBDT, and DistilBERT) on multiple benchmark datasets. Experiments demonstrate that our proposed method achieves up to 80\% reduction in sample requirements as compared to random sampling, substantially improving classification accuracy while reducing financial costs and overall training time.

Suggested Citation

  • Leland D. Crane & Xiaoyu Ge & Flora Haberkorn & Rithika Iyengar & Seung Jung Lee & Viviana Luccioli & Ryan Panley & Nitish R. Sinha, 2025. "LLM on a Budget: Active Knowledge Distillation for Efficient Classification of Large Text Corpora," Finance and Economics Discussion Series 2025-108, Board of Governors of the Federal Reserve System (U.S.).
  • Handle: RePEc:fip:fedgfe:102367
    DOI: 10.17016/FEDS.2025.108
    as

    Download full text from publisher

    File URL: https://www.federalreserve.gov/econres/feds/files/2025108pap.pdf
    File Function: Full text
    Download Restriction: no

    File URL: https://libkey.io/10.17016/FEDS.2025.108?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Fan, Cheng & Wu, Qiuting & Zhao, Yang & Mo, Like, 2024. "Integrating active learning and semi-supervised learning for improved data-driven HVAC fault diagnosis performance," Applied Energy, Elsevier, vol. 356(C).
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Deng, Qiao & Liu, Yufei & Chen, Zhiwen & Zhu, Wanting & Wang, Yalin & Gui, Weihua, 2025. "A novel physical information-guided predictive maintenance method for chillers," Applied Energy, Elsevier, vol. 402(PA).
    2. Fan, Cheng & Chen, Ruikun & Mo, Jinhan & Liao, Longhui, 2024. "Personalized federated learning for cross-building energy knowledge sharing: Cost-effective strategies and model architectures," Applied Energy, Elsevier, vol. 362(C).
    3. Cho, Yongjun & Kim, Donghoon & Kim, Jinho, 2026. "Data-driven demand response aggregation for public EV charging stations: Overcoming decoupled governance challenges," Applied Energy, Elsevier, vol. 402(PB).
    4. Wang, Zhanwei & Qin, Yijie & Kong, Yifan & Wang, Lin & Leng, Qiang & Zhang, Chunxiao, 2025. "Advanced fault detection, diagnosis and prognosis in HVAC systems: Lifecycle insight, key challenges, and promising approaches," Renewable and Sustainable Energy Reviews, Elsevier, vol. 219(C).
    5. Yan, Ke & Bi, Jian & Wang, Hua & Gao, Yuan & Afshari, Afshin, 2025. "A stable, reliable and interpretable diffusion model for HVAC FDD with data unavailability," Applied Energy, Elsevier, vol. 401(PC).
    6. Yan, Ke & He, Changfu & Wang, Chuan & Gao, Yuan & Du, Yang & Afshari, Afshin, 2026. "A few-shot learning framework for HVAC fault diagnosis in data centers with minimal data required," Applied Energy, Elsevier, vol. 402(PC).
    7. Zhang, Boyan & Wang, Jiaming & Rezgui, Yacine & Zhao, Tianyi, 2025. "Enhancing the generalizability of public building energy system fault detection method: A research on unknown multi-source fault detection and diagnosis method based on data-driven heuristic reasoning (DHR)," Energy, Elsevier, vol. 335(C).
    8. Zhang, Jiahao & Peng, Ruo & Lu, Chenbei & Wu, Chenye, 2025. "Computationally efficient data synthesis for AC-OPF: Integrating Physics-Informed Neural Network solvers and active learning," Applied Energy, Elsevier, vol. 378(PA).
    9. Zhang, Jian & Zhang, Chaobo & Lu, Jie & Zhao, Yang, 2025. "Domain-specific large language models for fault diagnosis of heating, ventilation, and air conditioning systems by labeled-data-supervised fine-tuning," Applied Energy, Elsevier, vol. 377(PA).
    10. Li, Guolong & Li, Yanjun & Su, Jian & Wang, Haotong & Sun, Shengdi & Zhao, Jiarui & Zhang, Guolei & Shi, Jianxin, 2025. "Fault diagnosis for supercharged boiler based on self-improving few-shot learning," Energy, Elsevier, vol. 316(C).
    11. Wenhao Lu & Wei Wang & Xuefei Qin & Zhiqiang Cai, 2024. "Enhancing Fault Diagnosis in Mechanical Systems with Graph Neural Networks Addressing Class Imbalance," Mathematics, MDPI, vol. 12(13), pages 1-22, July.

    More about this item

    Keywords

    ;
    ;
    ;

    JEL classification:

    • C38 - Mathematical and Quantitative Methods - - Multiple or Simultaneous Equation Models; Multiple Variables - - - Classification Methdos; Cluster Analysis; Principal Components; Factor Analysis
    • C45 - Mathematical and Quantitative Methods - - Econometric and Statistical Methods: Special Topics - - - Neural Networks and Related Topics
    • C55 - Mathematical and Quantitative Methods - - Econometric Modeling - - - Large Data Sets: Modeling and Analysis

    NEP fields

    This paper has been announced in the following NEP Reports:

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:fip:fedgfe:102367. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Ryan Wolfslayer ; Keisha Fournillier (email available below). General contact details of provider: https://edirc.repec.org/data/frbgvus.html .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.