IDEAS home Printed from https://ideas.repec.org/a/plo/pone00/0344262.html

Toxic chinese herbal medicine recognition in real-world images via multi-scale and attention-enhanced EfficientNetV2

Author

Listed:
  • Guohua Zhu
  • Jaehan Joo
  • Seonghyeon Park
  • Suk Chan Kim

Abstract

Accurate identification of toxic Chinese herbal medicines is critical for public health and clinical safety. However, real-world herbal images often exhibit complex backgrounds and small, indistinct target regions, posing substantial challenges to automated classification systems. In this study, we present a novel image dataset comprising over 4,000 samples from 47 toxic herb categories, captured under diverse environmental conditions to reflect real-world variability. We benchmark several state-of-the-art convolutional neural networks, including ResNet, ResNeXt, and EfficientNet variants, and identify EfficientNetV2 as the most effective baseline. To further enhance model robustness and discriminative capability, we propose an improved EfficientNetV2 architecture incorporating two lightweight yet effective modules: a Multi-Scale Feature Fusion (MSFF) module to integrate hierarchical features, and a Convolutional Block Attention Module (CBAM) to refine both spatial and channel-wise representations. Experimental results demonstrate that our enhanced model achieves 91.28% Top-1 accuracy, 97.52% Top-5 accuracy, and a 90.27% macro F1-score, significantly outperforming baseline methods. Ablation studies confirm the complementary benefits of MSFF and CBAM, and targeted evaluations on challenging image subsets reveal improved resilience to background clutter and small object localization. The proposed architecture offers a high-accuracy, generalizable, and computationally efficient solution for toxic herbal medicine classification and provides a valuable reference for intelligent traditional medicine recognition applications. The GitHub repository for this project is available at: https://github.com/zhuguohua1992/Toxic-chinese-herbal-medicine-recognition-via-enhanced-EfficientNetV2.

Suggested Citation

  • Guohua Zhu & Jaehan Joo & Seonghyeon Park & Suk Chan Kim, 2026. "Toxic chinese herbal medicine recognition in real-world images via multi-scale and attention-enhanced EfficientNetV2," PLOS ONE, Public Library of Science, vol. 21(3), pages 1-16, March.
  • Handle: RePEc:plo:pone00:0344262
    DOI: 10.1371/journal.pone.0344262
    as

    Download full text from publisher

    File URL: https://journals.plos.org/plosone/article?id=10.1371/journal.pone.0344262
    Download Restriction: no

    File URL: https://journals.plos.org/plosone/article/file?id=10.1371/journal.pone.0344262&type=printable
    Download Restriction: no

    File URL: https://libkey.io/10.1371/journal.pone.0344262?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:plo:pone00:0344262. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    We have no bibliographic references for this item. You can help adding them by using this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: plosone (email available below). General contact details of provider: https://journals.plos.org/plosone/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.