JOURNAL ARTICLE

Towards End-to-End Scene Text Spotting by Sharing Convolutional Feature Map

Abstract

In this paper, we propose a new algorithm that conjointly address scene text detection and recognition by sharing convolutional feature map. Compared with most systems which consider text detection and recognition to be irrelevant tasks, we integrate text detection and recognition into an end-to-end trainable neural network based on convolutional and recurrent neural network It can precisely detect and recognize text during a simple forward propagation, avoiding redundant processes like image patch cropping, repeated calculation of feature map. We train this unified neural network just by images and corresponding ground truth bounding boxes and text labels. Our algorithm gains outstanding performance in terms of computation time and accuracy on standard benchmark datasets. The proposed model runs robustly on multi-ratios images without complicated post-processing steps.

Keywords:
Computer science Convolutional neural network Artificial intelligence End-to-end principle Feature (linguistics) Benchmark (surveying) Spotting Bounding overwatch Pattern recognition (psychology) Computation Feature extraction Algorithm

Metrics

1
Cited By
0.11
FWCI (Field Weighted Citation Impact)
30
Refs
0.49
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Handwritten Text Recognition Techniques
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Vehicle License Plate Recognition
Physical Sciences →  Engineering →  Media Technology
Image Processing and 3D Reconstruction
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition

Related Documents

© 2026 ScienceGate Book Chapters — All rights reserved.