IDEAS home Printed from https://ideas.repec.org/a/gam/jforec/v3y2021i3p33-540d597969.html
   My bibliography  Save this article

Attention-Based CNN-RNN Arabic Text Recognition from Natural Scene Images

Author

Listed:
  • Hanan Butt

    (Department of Computer Science, COMSATS University Islamabad, Islamabad 45550, Pakistan)

  • Muhammad Raheel Raza

    (Department of Software Engineering, College of Technology, Firat University, 23000 Elazig, Turkey)

  • Muhammad Javed Ramzan

    (Department of Computer Science, COMSATS University Islamabad, Islamabad 45550, Pakistan)

  • Muhammad Junaid Ali

    (Department of Computer Science, COMSATS University Islamabad, Islamabad 45550, Pakistan)

  • Muhammad Haris

    (Department of Computer Science, COMSATS University Islamabad, Islamabad 45550, Pakistan)

Abstract

According to statistics, there are 422 million speakers of the Arabic language. Islam is the second-largest religion in the world, and its followers constitute approximately 25% of the world’s population. Since the Holy Quran is in Arabic, nearly all Muslims understand the Arabic language per some analytical information. Many countries have Arabic as their native and official language as well. In recent years, the number of internet users speaking the Arabic language has been increased, but there is very little work on it due to some complications. It is challenging to build a robust recognition system (RS) for cursive nature languages such as Arabic. These challenges become more complex if there are variations in text size, fonts, colors, orientation, lighting conditions, noise within a dataset, etc. To deal with them, deep learning models show noticeable results on data modeling and can handle large datasets. Convolutional neural networks (CNNs) and recurrent neural networks (RNNs) can select good features and follow the sequential data learning technique. These two neural networks offer impressive results in many research areas such as text recognition, voice recognition, several tasks of Natural Language Processing (NLP), and others. This paper presents a CNN-RNN model with an attention mechanism for Arabic image text recognition. The model takes an input image and generates feature sequences through a CNN. These sequences are transferred to a bidirectional RNN to obtain feature sequences in order. The bidirectional RNN can miss some preprocessing of text segmentation. Therefore, a bidirectional RNN with an attention mechanism is used to generate output, enabling the model to select relevant information from the feature sequences. An attention mechanism implements end-to-end training through a standard backpropagation algorithm.

Suggested Citation

  • Hanan Butt & Muhammad Raheel Raza & Muhammad Javed Ramzan & Muhammad Junaid Ali & Muhammad Haris, 2021. "Attention-Based CNN-RNN Arabic Text Recognition from Natural Scene Images," Forecasting, MDPI, vol. 3(3), pages 1-21, July.
  • Handle: RePEc:gam:jforec:v:3:y:2021:i:3:p:33-540:d:597969
    as

    Download full text from publisher

    File URL: https://www.mdpi.com/2571-9394/3/3/33/pdf
    Download Restriction: no

    File URL: https://www.mdpi.com/2571-9394/3/3/33/
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Amani Aldahiri & Bashair Alrashed & Walayat Hussain, 2021. "Trends in Using IoT with Machine Learning in Health Prediction System," Forecasting, MDPI, vol. 3(1), pages 1-26, March.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Alessandro Niccolai & Seyedamir Orooji & Andrea Matteri & Emanuele Ogliari & Sonia Leva, 2022. "Irradiance Nowcasting by Means of Deep-Learning Analysis of Infrared Images," Forecasting, MDPI, vol. 4(1), pages 1-11, March.
    2. Walayat Hussain & Asma Musabah Alkalbani & Honghao Gao, 2021. "Forecasting with Machine Learning Techniques," Forecasting, MDPI, vol. 3(4), pages 1-2, November.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Walayat Hussain & Asma Musabah Alkalbani & Honghao Gao, 2021. "Forecasting with Machine Learning Techniques," Forecasting, MDPI, vol. 3(4), pages 1-2, November.
    2. Carolina Del-Valle-Soto & Leonardo J. Valdivia & Juan Carlos López-Pimentel & Paolo Visconti, 2023. "Comparison of Collaborative and Cooperative Schemes in Sensor Networks for Non-Invasive Monitoring of People at Home," IJERPH, MDPI, vol. 20(7), pages 1-22, March.
    3. Xingxing Zong & Lian Wang & Qingyuan Xie & Mariusz Lipowski, 2022. "The Influence of Psychological Distance on the Challenging Moral Decision Support of Sports Majors in Internet of Things and Machine Learning," Sustainability, MDPI, vol. 14(19), pages 1-14, September.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:gam:jforec:v:3:y:2021:i:3:p:33-540:d:597969. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: MDPI Indexing Manager (email available below). General contact details of provider: https://www.mdpi.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.