Towards End-to-End Text Spotting with Convolutional Recurrent Neural Networks

Hui Li; Peng Wang; Chunhua Shen

doi:10.1109/iccv.2017.560

ScienceGate Book Chapters

JOURNAL ARTICLE

Towards End-to-End Text Spotting with Convolutional Recurrent Neural Networks

Hui Li Peng Wang Chunhua Shen

Year: 2017 Pages: 5248-5256

DOI: 10.1109/iccv.2017.560

Get Full-Text PDF Get Analytical Report

Abstract

In this work, we jointly address the problem of text detection and recognition in natural scene images based on convolutional recurrent neural networks. We propose a unified network that simultaneously localizes and recognizes text with a single forward pass, avoiding intermediate processes, such as image cropping, feature re-calculation, word separation, and character grouping. In contrast to existing approaches that consider text detection and recognition as two distinct tasks and tackle them one by one, the proposed framework settles these two tasks concurrently. The whole framework can be trained end-to-end, requiring only images, ground-truth bounding boxes and text labels. The convolutional features are calculated only once and shared by both detection and recognition, which saves processing time. Through multi-task training, the learned features become more informative and improves the overall performance. Our proposed method has achieved competitive performance on several benchmark datasets.

Keywords:

Computer science Convolutional neural network Artificial intelligence Benchmark (surveying) Bounding overwatch Spotting Feature extraction Pattern recognition (psychology) End-to-end principle Task (project management) Feature (linguistics) Text detection Word (group theory) Recurrent neural network Image (mathematics) Artificial neural network

Metrics

235

Cited By

11.95

FWCI (Field Weighted Citation Impact)

Refs

0.99

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

Handwritten Text Recognition Techniques

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

Vehicle License Plate Recognition

Physical Sciences → Engineering → Media Technology

Advanced Image and Video Retrieval Techniques

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

Towards End-to-End Text Spotting with Convolutional Recurrent Neural Networks

Abstract

Metrics

Citation History

Topics

Related Documents

Towards Unconstrained End-to-End Text Spotting

Towards End-to-End Scene Text Spotting by Sharing Convolutional Feature Map

Towards End-to-End Gesture Recognition with Recurrent Neural Networks

Towards end‐to‐end likelihood‐free inference with convolutional neural networks

Text Perceptron: Towards End-to-End Arbitrary-Shaped Text Spotting