This paper aims to study the problem of text recognition in complex natural scenes. Now the rapid development of technology makes most of the industry need fast and efficient text recognition tools, such as the logistics industry and publishers. Although traditional OCR technology can solve most simple background text recognition problems, the performance of complex scenes is somewhat unsatisfactory. In this paper, the focus is to extract the text in complex scenes accurately. The algorithm of stroke width transform and Convolution Neural Network is proposed. Using the stroke width transform algorithm for text positioning; And then separate the text line, end to the classifier to identify, through the recognition of the correct rate determine the effectiveness of the algorithm. We evaluate the proposed approach on a widely used dataset. Results show that our method achieves desired result compared with the state-of-the-art methods of the participant team which clearly demonstrate its competency.
Xiaoming HuangTao ShenRun WangChenqiang Gao