JOURNAL ARTICLE

Scene Text Detection with Recurrent Instance Segmentation

Abstract

Convolutional Neural Network (CNN) based scene text detection methods mostly employ the semantic segmentation (text/non-text classification) task to localize the regions of texts. However, they cannot distinguish different text-lines like instance segmentation. In this paper, we propose a novel framework based on Fully Convolutional Networks (FCN) and Recurrent Neural Network (RNN) to achieve both scene text detection and instance segmentation. The FCN is used to classify text and non-text regions, and the RNN utilizes the features extracted by FCN to simultaneously detect and segment one text instance at each time step. Meanwhile, it also extracts bounding boxes by a much simpler way than the non-maximum suppression (NMS) method. The proposed method achieves competitive results on two public benchmarks including ICDAR 2015 Incidental Scene Text Dataset and ICDAR 2013 Focused Scene Text Dataset. Moreover, the benefits of adding regression task in the RNN module are manifested.

Keywords:
Computer science Artificial intelligence Segmentation Bounding overwatch Text detection Pattern recognition (psychology) Convolutional neural network Recurrent neural network Task (project management) Image segmentation Text recognition Minimum bounding box Image (mathematics) Artificial neural network

Metrics

3
Cited By
0.00
FWCI (Field Weighted Citation Impact)
43
Refs
0.15
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Handwritten Text Recognition Techniques
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Natural Language Processing Techniques
Physical Sciences →  Computer Science →  Artificial Intelligence
Vehicle License Plate Recognition
Physical Sciences →  Engineering →  Media Technology

Related Documents

JOURNAL ARTICLE

TextMountain: Accurate scene text detection via instance segmentation

Yixing ZhuJun Du

Journal:   Pattern Recognition Year: 2020 Vol: 110 Pages: 107336-107336
BOOK-CHAPTER

TK-Text: Multi-shaped Scene Text Detection via Instance Segmentation

Xiaoge SongYirui WuWenhai WangTong Lü

Lecture notes in computer science Year: 2019 Pages: 201-213
JOURNAL ARTICLE

PixelLink: Detecting Scene Text via Instance Segmentation

Dan DengHaifeng LiuXuelong LiDeng Cai

Journal:   Proceedings of the AAAI Conference on Artificial Intelligence Year: 2018 Vol: 32 (1)
© 2026 ScienceGate Book Chapters — All rights reserved.