IDEAS home Printed from https://ideas.repec.org/a/gam/jmathe/v10y2022i20p3914-d949758.html
   My bibliography  Save this article

Deep-Learning-Based Complex Scene Text Detection Algorithm for Architectural Images

Author

Listed:
  • Weiwei Sun

    (Chongqing Key Laboratory of Photoelectronic Information Sensing and Transmitting Technology, Chongqing University of Posts and Telecommunications, Chongqing 400065, China)

  • Huiqian Wang

    (Chongqing Key Laboratory of Photoelectronic Information Sensing and Transmitting Technology, Chongqing University of Posts and Telecommunications, Chongqing 400065, China)

  • Yi Lu

    (Chongqing Key Laboratory of Photoelectronic Information Sensing and Transmitting Technology, Chongqing University of Posts and Telecommunications, Chongqing 400065, China)

  • Jiasai Luo

    (Chongqing Key Laboratory of Photoelectronic Information Sensing and Transmitting Technology, Chongqing University of Posts and Telecommunications, Chongqing 400065, China)

  • Ting Liu

    (Chongqing Key Laboratory of Photoelectronic Information Sensing and Transmitting Technology, Chongqing University of Posts and Telecommunications, Chongqing 400065, China)

  • Jinzhao Lin

    (Chongqing Key Laboratory of Photoelectronic Information Sensing and Transmitting Technology, Chongqing University of Posts and Telecommunications, Chongqing 400065, China)

  • Yu Pang

    (Chongqing Key Laboratory of Photoelectronic Information Sensing and Transmitting Technology, Chongqing University of Posts and Telecommunications, Chongqing 400065, China)

  • Guo Zhang

    (Chongqing Key Laboratory of Photoelectronic Information Sensing and Transmitting Technology, Chongqing University of Posts and Telecommunications, Chongqing 400065, China
    School of Medical Information and Engineering, Southwest Medical University, Luzhou 646000, China)

Abstract

With the advent of smart cities, the text information in an image can be accurately located and recognized, and then applied to the fields of instant translation, image retrieval, card surface information recognition, and license plate recognition. Thus, people’s lives and work will become more convenient and comfortable. Owing to the varied orientations, angles, and shapes of text, identifying textual features from images is challenging. Therefore, we propose an improved EAST detector algorithm for detecting and recognizing slanted text in images. The proposed algorithm uses reinforcement learning to train a recurrent neural network controller. The optimal fully convolutional neural network structure is selected, and multi-scale features of text are extracted. After importing this information into the output module, the Generalized Intersection over Union algorithm is used to enhance the regression effect of the text bounding box. Next, the loss function is adjusted to ensure a balance between positive and negative sample classes before outputting the improved text detection results. Experimental results indicate that the proposed algorithm can address the problem of category homogenization and improve the low recall rate in target detection. When compared with other image detection algorithms, the proposed algorithm can better identify slanted text in natural scene images. Finally, its ability to recognize text in complex environments is also excellent.

Suggested Citation

  • Weiwei Sun & Huiqian Wang & Yi Lu & Jiasai Luo & Ting Liu & Jinzhao Lin & Yu Pang & Guo Zhang, 2022. "Deep-Learning-Based Complex Scene Text Detection Algorithm for Architectural Images," Mathematics, MDPI, vol. 10(20), pages 1-22, October.
  • Handle: RePEc:gam:jmathe:v:10:y:2022:i:20:p:3914-:d:949758
    as

    Download full text from publisher

    File URL: https://www.mdpi.com/2227-7390/10/20/3914/pdf
    Download Restriction: no

    File URL: https://www.mdpi.com/2227-7390/10/20/3914/
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Fangsheng Wu & Changan Zhu & Jinxiu Xu & Mohammed Wasim Bhatt & Ashutosh Sharma, 2022. "Research on image text recognition based on canny edge detection algorithm and k-means algorithm," International Journal of System Assurance Engineering and Management, Springer;The Society for Reliability, Engineering Quality and Operations Management (SREQOM),India, and Division of Operation and Maintenance, Lulea University of Technology, Sweden, vol. 13(1), pages 72-80, March.
    2. Julia Diaz-Escobar & Vitaly Kober, 2020. "Natural Scene Text Detection and Segmentation Using Phase-Based Regions and Character Retrieval," Mathematical Problems in Engineering, Hindawi, vol. 2020, pages 1-17, June.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Jibi G. Thanikkal & Ashwani Kumar Dubey & M. T. Thomas, 2023. "A novel edge detection method for medicinal plant's leaf features extraction," International Journal of System Assurance Engineering and Management, Springer;The Society for Reliability, Engineering Quality and Operations Management (SREQOM),India, and Division of Operation and Maintenance, Lulea University of Technology, Sweden, vol. 14(1), pages 448-458, February.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:gam:jmathe:v:10:y:2022:i:20:p:3914-:d:949758. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: MDPI Indexing Manager (email available below). General contact details of provider: https://www.mdpi.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.