JOURNAL ARTICLE

Offline text-independent writer identification using a codebook with structural features

Bashar Qasem AhmedYaser F. HassanAshraf S. Elsayed

Year: 2023 Journal:   PLoS ONE Vol: 18 (4)Pages: e0284680-e0284680   Publisher: Public Library of Science

Abstract

Understanding handwritten documents is a vital and challenging problem that attracts many researchers in the fields of forensic and authentication science. This paper presents an offline system for text-independent writer identification of handwritten documents. The system extracts a handwritten connected component contour, which in turn is divided into segments of specific length. The system utilizes the concept of a bag of features in the writer recognition domain and considers handwritten contour segments to extract two conceptually simple and effective structural features. These features are the contour point curve angle and the CONtour point CONcavity/CONvexity. The system uses the proposed features to train a k-means clustering algorithm to construct a codebook of size K . The method then uses occurrence histograms of the extracted features in the codebook to create a final feature vector for each handwritten document. The effectiveness of the proposed features is evaluated in the writer identification domain using two widely used classification methods: the nearest neighbor and the support vector machine techniques. The proposed writer identification is evaluated on two large and public datasets from different language domains, the Arabic KHATT and English IAM datasets. The experimental results show that the proposed system outperforms state-of-the-art methods on the IAM dataset and provides competitive results on the KHATT dataset with respect to the identification rate.

Keywords:
Codebook Computer science Artificial intelligence Pattern recognition (psychology) Identification (biology) Support vector machine Feature (linguistics) Domain (mathematical analysis) Cluster analysis Feature vector Point (geometry) Segmentation Mathematics

Metrics

7
Cited By
1.27
FWCI (Field Weighted Citation Impact)
69
Refs
0.76
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Handwritten Text Recognition Techniques
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Image Processing and 3D Reconstruction
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Vehicle License Plate Recognition
Physical Sciences →  Engineering →  Media Technology
© 2026 ScienceGate Book Chapters — All rights reserved.