JOURNAL ARTICLE

Handwritten Chinese/Japanese Text Recognition Using Semi-Markov Conditional Random Fields

Xiangdong ZhouDa‐Han WangFeng TianCheng‐Lin LiuMasaki Nakagawa

Year: 2013 Journal:   IEEE Transactions on Pattern Analysis and Machine Intelligence Vol: 35 (10)Pages: 2413-2426   Publisher: IEEE Computer Society

Abstract

This paper proposes a method for handwritten Chinese/Japanese text (character string) recognition based on semi-Markov conditional random fields (semi-CRFs). The high-order semi-CRF model is defined on a lattice containing all possible segmentation-recognition hypotheses of a string to elegantly fuse the scores of candidate character recognition and the compatibilities of geometric and linguistic contexts by representing them in the feature functions. Based on given models of character recognition and compatibilities, the fusion parameters are optimized by minimizing the negative log-likelihood loss with a margin term on a training string sample set. A forward-backward lattice pruning algorithm is proposed to reduce the computation in training when trigram language models are used, and beam search techniques are investigated to accelerate the decoding speed. We evaluate the performance of the proposed method on unconstrained online handwritten text lines of three databases. On the test sets of databases CASIA-OLHWDB (Chinese) and TUAT Kondate (Japanese), the character level correct rates are 95.20 and 95.44 percent, and the accurate rates are 94.54 and 94.55 percent, respectively. On the test set (online handwritten texts) of ICDAR 2011 Chinese handwriting recognition competition, the proposed method outperforms the best system in competition.

Keywords:
Computer science Pattern recognition (psychology) Artificial intelligence Hidden Markov model Speech recognition Handwriting recognition Feature extraction Intelligent word recognition Chinese characters Test set String (physics) Intelligent character recognition Character recognition Mathematics Image (mathematics)

Metrics

89
Cited By
6.50
FWCI (Field Weighted Citation Impact)
72
Refs
0.97
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Handwritten Text Recognition Techniques
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Image Processing and 3D Reconstruction
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Image Retrieval and Classification Techniques
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
© 2026 ScienceGate Book Chapters — All rights reserved.