IDEAS home Printed from https://ideas.repec.org/a/gam/jftint/v12y2020i11p200-d446056.html
   My bibliography  Save this article

AT-Text: Assembling Text Components for Efficient Dense Scene Text Detection

Author

Listed:
  • Haiyan Li

    (Department of Computer Science and Engineering, Shanghai Jiao Tong University, Shanghai 200240, China
    School of Computer Science and Technology, Kashi University, Kashi 844000, China)

  • Hongtao Lu

    (Department of Computer Science and Engineering, Shanghai Jiao Tong University, Shanghai 200240, China)

Abstract

Text detection is a prerequisite for text recognition in scene images. Previous segmentation-based methods for detecting scene text have already achieved a promising performance. However, these kinds of approaches may produce spurious text instances, as they usually confuse the boundary of dense text instances, and then infer word/text line instances relying heavily on meticulous heuristic rules. We propose a novel Assembling Text Components (AT-text) that accurately detects dense text in scene images. The AT-text localizes word/text line instances in a bottom-up mechanism by assembling a parsimonious component set. We employ a segmentation model that encodes multi-scale text features, considerably improving the classification accuracy of text/non-text pixels. The text candidate components are finely classified and selected via discriminate segmentation results. This allows the AT-text to efficiently filter out false-positive candidate components, and then to assemble the remaining text components into different text instances. The AT-text works well on multi-oriented and multi-language text without complex post-processing and character-level annotation. Compared with the existing works, it achieves satisfactory results and a considerable balance between precision and recall without a large margin in ICDAR2013 and MSRA-TD 500 public benchmark datasets.

Suggested Citation

  • Haiyan Li & Hongtao Lu, 2020. "AT-Text: Assembling Text Components for Efficient Dense Scene Text Detection," Future Internet, MDPI, vol. 12(11), pages 1-14, November.
  • Handle: RePEc:gam:jftint:v:12:y:2020:i:11:p:200-:d:446056
    as

    Download full text from publisher

    File URL: https://www.mdpi.com/1999-5903/12/11/200/pdf
    Download Restriction: no

    File URL: https://www.mdpi.com/1999-5903/12/11/200/
    Download Restriction: no
    ---><---

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:gam:jftint:v:12:y:2020:i:11:p:200-:d:446056. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    We have no bibliographic references for this item. You can help adding them by using this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: MDPI Indexing Manager (email available below). General contact details of provider: https://www.mdpi.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.