JOURNAL ARTICLE

An End-to-End Scene Text Recognition for Bilingual Text

Bayan M. AlbalawiAmani JamalLama Al KhuzayemOlaa A. Alsaedi

Year: 2024 Journal:   Big Data and Cognitive Computing Vol: 8 (9)Pages: 117-117   Publisher: Multidisciplinary Digital Publishing Institute

Abstract

Text localization and recognition from natural scene images has gained a lot of attention recently due to its crucial role in various applications, such as autonomous driving and intelligent navigation. However, two significant gaps exist in this area: (1) prior research has primarily focused on recognizing English text, whereas Arabic text has been underrepresented, and (2) most prior research has adopted separate approaches for scene text localization and recognition, as opposed to one integrated framework. To address these gaps, we propose a novel bilingual end-to-end approach that localizes and recognizes both Arabic and English text within a single natural scene image. Specifically, our approach utilizes pre-trained CNN models (ResNet and EfficientNetV2) with kernel representation for localization text and RNN models (LSTM and BiLSTM) with an attention mechanism for text recognition. In addition, the AraElectra Arabic language model was incorporated to enhance Arabic text recognition. Experimental results on the EvArest, ICDAR2017, and ICDAR2019 datasets demonstrated that our model not only achieves superior performance in recognizing horizontally oriented text but also in recognizing multi-oriented and curved Arabic and English text in natural scene images.

Keywords:
Computer science End-to-end principle Natural language processing Artificial intelligence Information retrieval Speech recognition

Metrics

3
Cited By
1.59
FWCI (Field Weighted Citation Impact)
98
Refs
0.75
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Handwritten Text Recognition Techniques
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Natural Language Processing Techniques
Physical Sciences →  Computer Science →  Artificial Intelligence
Image Processing and 3D Reconstruction
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition

Related Documents

JOURNAL ARTICLE

Transformer-based end-to-end scene text recognition

Xinghao ZhuZhi Zhang

Year: 2021 Vol: 19 Pages: 1691-1695
BOOK-CHAPTER

End-to-End Scene Text Recognition Network with Adaptable Text Rectification

Yi ZhangZhiwen LiLei GuoWenbi Rao

Lecture notes on data engineering and communications technologies Year: 2021 Pages: 175-184
BOOK-CHAPTER

End-to-End Scene Text Recognition System for Devanagari and Bengali Text

Prithwish SenAnindita DasNilkanta Sahu

Lecture notes in networks and systems Year: 2021 Pages: 352-359
© 2026 ScienceGate Book Chapters — All rights reserved.