JOURNAL ARTICLE

Scene text spotting based on end-to-end

Guangcun WeiWansheng RongYongquan LiangXinguang XiaoXiang Liu

Year: 2021 Journal:   Journal of Intelligent & Fuzzy Systems Vol: 40 (5)Pages: 8871-8881   Publisher: IOS Press

Abstract

Aiming at the problem that the traditional OCR processing method ignores the inherent connection between the text detection task and the text recognition task, This paper propose a novel end-to-end text spotting framework. The framework includes three parts: shared convolutional feature network, text detector and text recognizer. By sharing convolutional feature network, the text detection network and the text recognition network can be jointly optimized at the same time. On the one hand, it can reduce the computational burden; on the other hand, it can effectively use the inherent connection between text detection and text recognition. This model add the TCM (Text Context Module) on the basis of Mask RCNN, which can effectively solve the negative sample problem in text detection tasks. This paper propose a text recognition model based on the SAM-BiLSTM (spatial attention mechanism with BiLSTM), which can more effectively extract the semantic information between characters. This model significantly surpasses state-of-the-art methods on a number of text detection and text spotting benchmarks, including ICDAR 2015, Total-Text.

Keywords:
Spotting Computer science Text detection Artificial intelligence Context (archaeology) Task (project management) End-to-end principle Pattern recognition (psychology) Feature (linguistics) Connection (principal bundle) Natural language processing Image (mathematics)

Metrics

0
Cited By
0.00
FWCI (Field Weighted Citation Impact)
24
Refs
0.02
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Topics

Handwritten Text Recognition Techniques
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Image Processing and 3D Reconstruction
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Vehicle License Plate Recognition
Physical Sciences →  Engineering →  Media Technology
© 2026 ScienceGate Book Chapters — All rights reserved.