JOURNAL ARTICLE

Towards End-to-End Text Spotting with Convolutional Recurrent Neural Networks

Abstract

In this work, we jointly address the problem of text detection and recognition in natural scene images based on convolutional recurrent neural networks. We propose a unified network that simultaneously localizes and recognizes text with a single forward pass, avoiding intermediate processes, such as image cropping, feature re-calculation, word separation, and character grouping. In contrast to existing approaches that consider text detection and recognition as two distinct tasks and tackle them one by one, the proposed framework settles these two tasks concurrently. The whole framework can be trained end-to-end, requiring only images, ground-truth bounding boxes and text labels. The convolutional features are calculated only once and shared by both detection and recognition, which saves processing time. Through multi-task training, the learned features become more informative and improves the overall performance. Our proposed method has achieved competitive performance on several benchmark datasets.

Keywords:
Computer science Convolutional neural network Artificial intelligence Benchmark (surveying) Bounding overwatch Spotting Feature extraction Pattern recognition (psychology) End-to-end principle Task (project management) Feature (linguistics) Text detection Word (group theory) Recurrent neural network Image (mathematics) Artificial neural network

Metrics

235
Cited By
11.95
FWCI (Field Weighted Citation Impact)
53
Refs
0.99
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Handwritten Text Recognition Techniques
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Vehicle License Plate Recognition
Physical Sciences →  Engineering →  Media Technology
Advanced Image and Video Retrieval Techniques
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition

Related Documents

BOOK-CHAPTER

Towards End-to-End Gesture Recognition with Recurrent Neural Networks

Tong DuXuemei Ren

Lecture notes in electrical engineering Year: 2018 Pages: 145-153
JOURNAL ARTICLE

Towards end‐to‐end likelihood‐free inference with convolutional neural networks

Stefan T. RadevUlf K. MertensAndreas VoßUllrich Köthe

Journal:   British Journal of Mathematical and Statistical Psychology Year: 2019 Vol: 73 (1)Pages: 23-43
JOURNAL ARTICLE

Text Perceptron: Towards End-to-End Arbitrary-Shaped Text Spotting

Qiao LiangSanli TangZhanzhan ChengYunlu XuYi NiuShiliang PuFei Wu

Journal:   Proceedings of the AAAI Conference on Artificial Intelligence Year: 2020 Vol: 34 (07)Pages: 11899-11907
© 2026 ScienceGate Book Chapters — All rights reserved.