Multilingual Translational Optical Character Recognition System for Printed Telugu Text

Baratam Vijaya Sai Abhishek; K. Yamuna; T Anjali

doi:10.1109/icccnt51525.2021.9579619

ScienceGate Book Chapters

JOURNAL ARTICLE

Multilingual Translational Optical Character Recognition System for Printed Telugu Text

Baratam Vijaya Sai Abhishek K. Yamuna T Anjali

Year: 2021 Pages: 1-5

DOI: 10.1109/icccnt51525.2021.9579619

Get Full-Text PDF Get Analytical Report

Abstract

OCR, an acronym for "Optical Character Recognition" is a system that automatically grabs information one needs from scanned images of typewritten or printed text by translating them into machine-encoded text. OCR today is embedded in many applications, websites, etc., but most of these systems operate for Latin-based scripts such as Roman and English. India is a multilingual country with more than 19,500 languages or dialects spoken as mother tongues. Due to this diversity, many works are not reported in Indian languages. Most of the Indian language has large character sets that are complex in structure compared to Latin-based scripts. Transfer learning of Latin-based OCR systems to Telugu is hence a difficult undertaking. Neural networks are best equipped to meet the difficulty of Telugu OCR. This work aims to develop a multilingual translation OCR system that can recognize the basic printed texts of Telugu scripts.

Keywords:

Telugu Computer science Scripting language Optical character recognition Natural language processing Artificial intelligence Character (mathematics) Tamil Acronym Sphinx Romanization Linguistics Image (mathematics) History

Metrics

Cited By

0.10

FWCI (Field Weighted Citation Impact)

Refs

0.41

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

Handwritten Text Recognition Techniques

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

Vehicle License Plate Recognition

Physical Sciences → Engineering → Media Technology

Image Retrieval and Classification Techniques

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

Multilingual Translational Optical Character Recognition System for Printed Telugu Text

Abstract

Metrics

Citation History

Topics

Related Documents

An optical character recognition system for printed Telugu text

Optical Character Recognition for Handwritten Telugu Text

Optical Character Recognition of Telugu Text Using Inception Model

Optical character recognition of arabic printed text

Devanagari optical character recognition of printed text