JOURNAL ARTICLE

Automated Text-to-Audio Conversion for Visually Impaired People Using Optical Character Recognition

Abstract

This work aims to get text from images and documents like Portable Document Format (PDF) and PowerPoint Presentation (PPT) using Optical Character Recognition (OCR). The text is turned into speech, and thus, audio files are received. Organizing these audio files in a specific folder makes it easier to find and listen to them. The work plan is to create a tool that can take documents, PDFs, or PPT files as input and extract letters and numbers from them. This tool is great for quickly entering data from printed documents. Many images are used as input for the tool, which uses a machine to find patterns in the images and extract characters. Python is the main tool used for this work. A Python wrapper for Tesseract is used to test OCR on images first to make sure it works well. Then, the solution is used with a live video feed from a smartphone, processed with OpenCV. The text obtained is then turned into speech using Google Text-To-Speech (gTTS). With this approach, the system can read any text it finds out loud. By combining image processing, OCR, and text-to-speech, the system aims to make it easy and enjoyable to listen to text.

Keywords:
Visually impaired Character (mathematics) Computer science Optical character recognition Speech recognition Character recognition Audio visual Artificial intelligence Natural language processing Computer vision Human–computer interaction Multimedia Image (mathematics) Mathematics

Metrics

0
Cited By
0.00
FWCI (Field Weighted Citation Impact)
0
Refs
0.18
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Topics

Vehicle License Plate Recognition
Physical Sciences →  Engineering →  Media Technology
Handwritten Text Recognition Techniques
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Subtitles and Audiovisual Media
Social Sciences →  Arts and Humanities →  Language and Linguistics

Related Documents

JOURNAL ARTICLE

Text to Speech Conversion using Optical character Recognition for Visually Impaired Persons

Prince sainiRajesh Mehra

Journal:   International Journal of Computer Trends and Technology Year: 2015 Vol: 29 (2)Pages: 97-102
JOURNAL ARTICLE

Effective Shopping Method for Visually Impaired People using Optical Character Recognition

S. MeeraR. Sharmikha SreeDr . K . Valarmathi

Journal:   International Journal of Engineering and Advanced Technology Year: 2019 Vol: 9 (1)Pages: 5304-5306
JOURNAL ARTICLE

Image Text to Speech Conversion Using Optical Character Recognition

S. Priyadharshini

Journal:   International Journal of Psychosocial Rehabilitation Year: 2020 Vol: 24 (5)Pages: 4199-4205
© 2026 ScienceGate Book Chapters — All rights reserved.