A Modified Method for Scene Text Detection by ResNet

Shaozhang Niu; Xiangxiang Li; Maosen Wang; Yueying Li

doi:10.32604/cmc.2020.09471

Open Access icon Open Access

ARTICLE

A Modified Method for Scene Text Detection by ResNet

Shaozhang Niu^{1, *}, Xiangxiang Li¹, Maosen Wang¹, Yueying Li²

1 School of Computer Science and Technology, Beijing University of Posts and Telecommunications, Beijing, 100876, China.
2 College of Arts and Sciences, Boston University, Boston University, Boston, 02215, USA.

* Corresponding Author: Shaozhang Niu. Email: email .

Computers, Materials & Continua 2020, 65(3), 2233-2245. https://doi.org/10.32604/cmc.2020.09471

Received 18 December 2019; Accepted 21 June 2020; Issue published 16 September 2020

Download PDF

Abstract

In recent years, images have played a more and more important role in our daily life and social communication. To some extent, the textual information contained in the pictures is an important factor in understanding the content of the scenes themselves. The more accurate the text detection of the natural scenes is, the more accurate our semantic understanding of the images will be. Thus, scene text detection has also become the hot spot in the domain of computer vision. In this paper, we have presented a modified text detection network which is based on further research and improvement of Connectionist Text Proposal Network (CTPN) proposed by previous researchers. To extract deeper features that are less affected by different images, we use Residual Network (ResNet) to replace Visual Geometry Group Network (VGGNet) which is used in the original network. Meanwhile, to enhance the robustness of the models to multiple languages, we use the datasets for training from multi-lingual scene text detection and script identification datasets (MLT) of 2017 International Conference on Document Analysis and Recognition (ICDAR2017). And apart from that, the attention mechanism is used to get more reasonable weight distribution. We found the proposed models achieve 0.91 F1-score on ICDAR2011 test, better than CTPN trained on the same datasets by about 5%.

Keywords

CTPN, scene text detection, ResNet, attention.

Cite This Article

APA Style

Niu, S., Li, X., Wang, M., Li, Y. (2020). A modified method for scene text detection by resnet. Computers, Materials & Continua, 65(3), 2233-2245. https://doi.org/10.32604/cmc.2020.09471

Vancouver Style

Niu S, Li X, Wang M, Li Y. A modified method for scene text detection by resnet. Comput Mater Contin. 2020;65(3):2233-2245 https://doi.org/10.32604/cmc.2020.09471

IEEE Style

S. Niu, X. Li, M. Wang, and Y. Li, “A Modified Method for Scene Text Detection by ResNet,” Comput. Mater. Contin., vol. 65, no. 3, pp. 2233-2245, 2020. https://doi.org/10.32604/cmc.2020.09471

BibTex EndNote RIS

Copyright © 2020 The Author(s). Published by Tech Science Press.
This work is licensed under a Creative Commons Attribution 4.0 International License , which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Table of Content

A Modified Method for Scene Text Detection by ResNet

Abstract

Keywords

Cite This Article

2279

1523

0

Related articles

Further Information

Guidelines

Follow Us

Join Us

Share Link