IDEAS home Printed from https://ideas.repec.org/a/gam/jsusta/v14y2022i19p12479-d930463.html
   My bibliography  Save this article

A Comparative Study of Engraved-Digit Data Augmentation by Generative Adversarial Networks

Author

Listed:
  • Abdulkabir Abdulraheem

    (School of Electronic and Electrical Engineering, Kyungpook National University, Daegu 41566, Korea)

  • Im Y. Jung

    (School of Electronic and Electrical Engineering, Kyungpook National University, Daegu 41566, Korea)

Abstract

In cases where an efficient information retrieval (IR) system retrieves information from images with engraved digits, as found on medicines, creams, ointments, and gels in squeeze tubes, the system needs to be trained on a large dataset. One of the system applications is to automatically retrieve the expiry date to ascertain the efficacy of the medicine. For expiry dates expressed in engraved digits, it is difficult to collect the digit images. In our study, we evaluated the augmentation performance for a limited, engraved-digit dataset using various generative adversarial networks (GANs). Our study contributes to the choice of an effective GAN for engraved-digit image data augmentation. We conclude that Wasserstein GAN with a gradient norm penalty (WGAN-GP) is a suitable data augmentation technique to address the challenge of producing a large, realistic, but synthetic dataset. Our results show that the stability of WGAN-GP aids in the production of high-quality data with an average Fréchet inception distance (FID) value of 1.5298 across images of 10 digits (0–9) that are nearly indistinguishable from our original dataset.

Suggested Citation

  • Abdulkabir Abdulraheem & Im Y. Jung, 2022. "A Comparative Study of Engraved-Digit Data Augmentation by Generative Adversarial Networks," Sustainability, MDPI, vol. 14(19), pages 1-14, September.
  • Handle: RePEc:gam:jsusta:v:14:y:2022:i:19:p:12479-:d:930463
    as

    Download full text from publisher

    File URL: https://www.mdpi.com/2071-1050/14/19/12479/pdf
    Download Restriction: no

    File URL: https://www.mdpi.com/2071-1050/14/19/12479/
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Dowson, D. C. & Landau, B. V., 1982. "The Fréchet distance between multivariate normal distributions," Journal of Multivariate Analysis, Elsevier, vol. 12(3), pages 450-455, September.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Abdulkabir Abdulraheem & Im Y. Jung, 2023. "Effective Digital Technology Enabling Automatic Recognition of Special-Type Marking of Expiry Dates," Sustainability, MDPI, vol. 15(17), pages 1-22, August.
    2. Abdulkabir Abdulraheem & Jamiu T. Suleiman & Im Y. Jung, 2023. "Enhancing the Automatic Recognition Accuracy of Imprinted Ship Characters by Using Machine Learning," Sustainability, MDPI, vol. 15(19), pages 1-20, September.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Elham Yousefi & Luc Pronzato & Markus Hainy & Werner G. Müller & Henry P. Wynn, 2023. "Discrimination between Gaussian process models: active learning and static constructions," Statistical Papers, Springer, vol. 64(4), pages 1275-1304, August.
    2. Knott, Martin & Smith, Cyril, 2006. "Choosing joint distributions so that the variance of the sum is small," Journal of Multivariate Analysis, Elsevier, vol. 97(8), pages 1757-1765, September.
    3. Rippl, Thomas & Munk, Axel & Sturm, Anja, 2016. "Limit laws of the empirical Wasserstein distance: Gaussian distributions," Journal of Multivariate Analysis, Elsevier, vol. 151(C), pages 90-109.
    4. Zhongzhi Lawrence He, 2018. "Comparing Asset Pricing Models: Distance-based Metrics and Bayesian Interpretations," Papers 1803.01389, arXiv.org.
    5. Mordant, Gilles & Segers, Johan, 2022. "Measuring dependence between random vectors via optimal transport," Journal of Multivariate Analysis, Elsevier, vol. 189(C).
    6. Whiteley, Nick, 2021. "Dimension-free Wasserstein contraction of nonlinear filters," Stochastic Processes and their Applications, Elsevier, vol. 135(C), pages 31-50.
    7. Ledoit, Olivier & Wolf, Michael, 2021. "Shrinkage estimation of large covariance matrices: Keep it simple, statistician?," Journal of Multivariate Analysis, Elsevier, vol. 186(C).
    8. Puccetti, Giovanni & Rüschendorf, Ludger & Vanduffel, Steven, 2020. "On the computation of Wasserstein barycenters," Journal of Multivariate Analysis, Elsevier, vol. 176(C).
    9. Nabil Kahalé, 2019. "Efficient Simulation of High Dimensional Gaussian Vectors," Mathematics of Operations Research, INFORMS, vol. 44(1), pages 58-73, February.
    10. Artur Karimov & Ekaterina Kopets & Tatiana Shpilevaya & Evgenii Katser & Sergey Leonov & Denis Butusov, 2023. "Comparing Neural Style Transfer and Gradient-Based Algorithms in Brushstroke Rendering Tasks," Mathematics, MDPI, vol. 11(10), pages 1-30, May.
    11. Xu, Ganggang & Zhu, Huirong & Lee, J. Jack, 2020. "Borrowing strength and borrowing index for Bayesian hierarchical models," Computational Statistics & Data Analysis, Elsevier, vol. 144(C).
    12. Olivier Ledoit & Michael Wolf, 2019. "Shrinkage estimation of large covariance matrices: keep it simple, statistician?," ECON - Working Papers 327, Department of Economics - University of Zurich, revised Jun 2021.
    13. Zhongzhi Lawrence He, 2018. "Generalized Information Ratio," Papers 1803.01381, arXiv.org, revised Apr 2018.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:gam:jsusta:v:14:y:2022:i:19:p:12479-:d:930463. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: MDPI Indexing Manager (email available below). General contact details of provider: https://www.mdpi.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.