Scene text recognition is a fundamental step in End-to-End applications where traditional optical character recognition (OCR) systems often fail to produce satisfactory results. This paper proposes a technique that uses co-occurrence histogram of oriented gradients (Co-HOG) to recognize the text in scenes. Compared with histogram of oriented gradients (HOG), Co-HOG is a more powerful tool that captures spatial distribution of neighboring orientation pairs instead of just a single gradient orientation. At the same time, it is more efficient compared with HOG and therefore more suitable for real-time applications. The proposed scene text recognition technique is evaluated on ICDAR2003 character dataset and Street View Text (SVT) dataset. Experiments show that the Co-HOG based technique clearly outperforms state-of-the-art techniques that use HOG, Scale Invariant Feature Transform (SIFT), and Maximally Stable Extremal Regions (MSER).
Shangxuan TianUjjwal BhattacharyaShijian LuBolan SuQingqing WangXiaohua WeiYue LuChew Lim Tan
Zhi Rong TanShangxuan TianChew Lim Tan
Alberto Dzul CalvilloRoberto A. VázquezJosé Ambrosio BastiánAxel Waltier
Nurul HidayatAris TjahyantoXenny Zarvina LovianoMahmud YunusDieky Adzkiya