Automated system for Arabic optical character recognition

Inad Aljarrah; Osama Al-Khaleel; Khaldoon Mhaidat; Mu'ath Alrefai; Abdullah Alzu'bi; Mohammad A.A. Al Rababah

doi:10.1145/2222444.2222449

ScienceGate Book Chapters

JOURNAL ARTICLE

Automated system for Arabic optical character recognition

Inad Aljarrah Osama Al-Khaleel Khaldoon Mhaidat Mu'ath Alrefai Abdullah Alzu'bi Mohammad A.A. Al Rababah

Year: 2012 Pages: 1-6

DOI: 10.1145/2222444.2222449

Get Full-Text PDF Get Analytical Report

Abstract

In this paper an Arabic Optical Character Recognition system is implemented. The system takes a scanned image of an Arabic text as an input and generates an editable text out of it. The system starts by segmenting the document which is presented as an image into lines, then each line is also segmented into separate words, after that each word is further segmented to sub-words. Each word or sub-word is segmented into separate characters, and then a features extraction process is applied on each character to calculate its features vector. The feature vector is then compared with templates of feature vectors for each of the Arabic alphabet with their variations. The minimum distance classifier is used in the classification stage. Promising results are achieved regardless that Arabic Optical Character Recognition is considered many times harder to handle than its counterparts in other languages like English due to the continuity between the letters in the same word.

Keywords:

Computer science Artificial intelligence Optical character recognition Character (mathematics) Word (group theory) Feature extraction Arabic Alphabet Classifier (UML) Pattern recognition (psychology) Natural language processing Feature (linguistics) Text segmentation Feature vector Support vector machine Speech recognition Segmentation Image (mathematics) Mathematics Linguistics

Metrics

Cited By

0.55

FWCI (Field Weighted Citation Impact)

Refs

0.69

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

Handwritten Text Recognition Techniques

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

Image Retrieval and Classification Techniques

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

Vehicle License Plate Recognition

Physical Sciences → Engineering → Media Technology

Automated system for Arabic optical character recognition

Abstract

Metrics

Citation History

Topics

Related Documents

Automated System for Arabic Optical Character Recognition with Lookup Dictionary

Omnifont arabic optical character recognition system

A recognition-based Arabic optical character recognition system

Arabic Optical Character Recognition

An Arabic optical character recognition system using recognition-based segmentation